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NOVEL THERAPEUTIC TARGET FOR TREATING VASCULAR 
DISEASES, DYSLIPIDEMIAS AND RELATED DISORDERS 



10 No. 60/391,878, filed 27 June 2002, the disclosure of which is hereby 
incorporated by reference in its entirety. 



The present invention relates to the discovery of gene95, a novel 
therapeutic target for disease diagnosis and treatment, especially diagnosis 
and treatment of cardiovascular diseases, such as dyslipidemia and low HDL 
20 diseases, including hypoalphalipoproteinemia, as well as therapeutic agents 
and methods of screening and identifying therapeutic agents useful in 
preventing, treating or otherwise ameliorating such disorders. 



25 BACKGROUND OF THE INVENTION 

Epidemiological studies have consistently demonstrated that plasma 
high density lipoprotein cholesterol (HDL-C) concentration is inversely related 
to the incidence of vascular disease, particularly cardiovascular disease 
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(CVD) andfj^qnary artery disease (CAD). HDL^fcevels are a strongly 
graded and independent cardiovascular risk factor. Protective effects of an 
elevated HDL-C persist until 80 years of age. A low HDL-C is associated with 
an increased CAD risk even with normal (<5.2 mmol/l) total plasma 
5 cholesterol levels. Even in the face of other dyslipidemias or secondary 
factors, HDL-C levels are important predictors of CAD. Low HDL cholesterol 
(in severe cases called hypoalphalipoproteinemia), is also implicated in 
cerebrovascular disease, coronary restenosis, and peripheral vascular 
disease. 

10 

Pharmacological intervention of low HDL-C levels has so far proven 
unsatisfactory. Currently, there are no FDA approved therapeutic agents 
which modulate HDL levels in a direct significant fashion. Current research 
strategies aimed towards modulating HDL involve increasing production of 

15 ApoA1, promoting the rate of reverse cholesterol transport (RCT) and 
decreasing catabolism of HDL. For example, increasing ApoA1 levels may be 
possible via small molecular upregulation of transcription or by infusion of 
ApoA1 protein. Alternatively, it may be possible to upregulate RCT by 
developing ABCA1 agonists and increasing cholesterol efflux from peripheral 

20 tissues (see PCT publication WO 00/55318, incorporated herein by 
reference). Catabolism of HDL is regulated by a number of enzymes, 
including Cholesteryl Ester Transfer Protein (CETP), which may also be a 
suitable therapeutic target for HDL regulation. 

Identification of additional genes and corresponding proteins which act 
25 to modulate HDL levels in humans would allow rational choice of which 
therapeutic approach to pursue in terms of drug development The present 
invention relates to the discovery of a new target for therapeutic intervention 
in the treatment of vascular diseases and dyslipidemias and use of said target 
to screen for and identify therapeutic agents useful in alleviating low HDL 
30 levels and the symptoms of attendant disorders. 
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The present invention provides the complete gene95 cDNA nucleotide 
sequence (SEQ ID NO: 3) and gene95 amino acid sequence (SEQ ID NO: 4) 
from humans. The invention further identifies the role and function of gene95 
as a key regulator of HDL cholesterol levels by identification of mutant forms 
of the gene which are associated with inherited low HDL disorders in humans. 

In one aspect, the present invention provides the purified nucleic acid 
sequence of gene95 and fragments thereof, including purified and semi- 
purified forms thereof, as well as recombinant cell lines, viruses or cell 
extracts containing gene95 nucleotide constructs. 

In another aspect, the present invention provides the purified gene95 
protein and fragments thereof, including purified and semi-purified forms 
thereof, as well as cells or cell extracts containing gene95 biological activity, 
and anti-gene95 antibodies, including methods of making such polypeptides. 

In one aspect, the present invention provides a method for determining 
whether a candidate compound modulates HDL levels and/or treating a 
vascular disease or dyslipidemia. The method comprises (a) determining a 
biological activity of the gene95 gene or protein; (b) contacting a source of this 
biological activity with a candidate compound under conditions promoting this 
biological activity; and (c) determining a change in the biological activity of 
said gene or protein as a result of said contacting, wherein the change in 
biological activity identifies the candidate compound, or its an analog of the 
candidate compound, as a compound that modulates said biological activity 
and is useful in regulating HDL levels and/or treating a vascular disease, 
dyslipidemia or a related disorder. Other types of assays for Gene95 activity 
are also specifically contemplated by the present invention. 
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In ar^^r aspect, the invention provides a n^j^d for computationally 
identifying a compound for modulating HDL levels and/or treating a vascular 
disease or dyslipidemia. The method involves (a) determining the active site 
of the gene95 protein selected (for example, through X-Ray crystallography or 
5 other techniques); and (b) through computational modeling, identifying a 
compound which interacts with the active site, thereby identifying a 
compound, or an analog thereof, as a compound which is useful for 
modulating HDL levels and/or treating a vascular disease, dyslipidemia or a 
related disorder. 



10 



15 



In another aspect, the invention provides a method for modulating HDL 
levels and/or treating a vascular disease, dyslipidemia or a related disorder, 
comprising administering to a person in need thereof, a compound which 
modulates the activity of gene 95 gene or protein. 



In a further aspect, the invention provides tools for diagnosis of 
vascular disease and dyslipidemias and/or pharmacogenomics of therapeutic 
agents for treating vascular disease and dyslipidemias, comprising 
identification of mutations in the gene95 gene or protein, for example the 
20 mutations identified in SEQ ID Nos. 6 - 9 or in Table 1 . 



BRIEF DESCRIPTION OF THE DRAWINGS 

25 

Figure 1 shows the pedigree of family BC-1 1 M. 
Figure 2 shows the pedigree of family NL-003. 

30 

Figure 3 shows the pedigree of family NL-120. 
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Figure 1 ; 



astrates the expected 10 exon con: 




of gene95 cDNA. 
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Incomplete constructs assembled from known fragments of gene95 are also 
illustrated, including other possible transcripts employing E6a or E9a. 

Figures 5A-5G provide the sequence of exons and introns and the 
amino acid translation of gene95. Legend: Lower case letters = intron regions 
not forming part of mature mRNA or cDNA. = intron regions available from 
public database; not included herein. Capital Letters = Nucleotide base or 
Amino Acid (as indicated by context; with standard designations used). "*" = a 
stop codon. +1, +2, +3: Frameshift for translation of nucleic acid. Short open 
reading frames before and after the correct amino acid sequence are 
indicated for completeness, but are not expected to form part of the gene95 
protein. Here, exon E1 is part of SEQ ID NO: 18, exon E2 is part of SEQ ID 
NO: 19, exon E3 is part of SEQ ID NO: 20, exon E4 is part of SEQ ID NO: 21, 
exon E5 is part of SEQ ID NO: 22, exon E6 is part of SEQ ID NO: 23, exon E7 
is part of SEQ ID NO: 24, exon E8 is part of SEQ ID NO: 25, exon E9 is part 
of SEQ ID NO: 26, exon E10 is part of SEQ ID NO: 27, exon E6a is part of 
SEQ ID NO: 28, and exon E9a is part of SEQ ID NO: 29. Here, exon 6a = 
Alternative Exon 6 ("Exon 6 alt") (truncates protein) while exon 9a = 
Alternative exon 9 ("Exon 9 alt") (longer, but truncates). In addition, ctttga 
(SEQ ID NO: 30) shows the start of a new intronic sequence. 

Figure 6 illustrates the alignment of human (SEQ ID NO: 4) and 
mouse (SEQ ID NO: 5) amino acid sequences of gene95. 

Figure 7 shows the alignments of the amino acid sequence of mutation 
1 (SEQ ID NO: 7) and mutation 2 (SEQ ID NO: 16) with the wild-type amino 
acid sequence (SEQ ID NO: 4) of human gene95. 

Figure 8 illustrates the predicted exon assembly of gene95, with 
locations of mutations and polymorphisms identified according to the 
invention. 
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Figure 9 shows a Northern blot with a Gene 95 probe illustrating the 
results of Example 3. 



10 As used herein, the following terms have their indicated meanings 

unless expressly stated otherwise. 

As used herein, the term "correspond" in the sense of "a gene 
corresponding to gene95 D , means that the gene has the indicated nucleotide 
15 sequence or that it encodes substantially the same RNA as would be 
encoded by the indicated sequence, the term "substantially" meaning at least 
about 50% identical over a sequence encoding a protein with gene95 
biological activity; more preferably at least about 60%, 70%, 80%, 90% or 
higher, and as defined elsewhere herein, including splice variants thereof. 



"Corresponding to gene95" also includes homologs or orthologs of 
gene95 from another organism, and sequence variants thereof having at least 
about 50% identity, at either the nucleotide or amino acid level, and more 
preferably at least about 60%, 70%, 80%, 90% or higher homology. 

25 Preferably such gene is from a eukaryote, more preferably a mammal, more 
preferably a rodent or human. The human genes and proteins listed in this 
specification are representative of a wide variety of homologs, orthologs and 
analogs that may be obtained from other species. These homologs, orthologs 
and/or analogs, and reasonable variants thereof, can be used in the 

30 compositions and methods of this invention. For example, a screening assay 
which uses the mouse homolog of a listed human gene may also be used in a 
screening assay to identify potential therapeutic agents for treating humans. 
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As such, the^fcent invention relates to the use of the^isoforms, homologs 
from other species, and analogous sequences which, when translated, retain 
the biological activity of the isoform, as determined on a region by region 
basis. Such analogous sequences share preferably greater than 50% identity 
5 with SEQ ID NO: 3 or SEQ ID NO: 4, more preferably 60%, 70%, 80% and 
most preferably at least about 90% identity with the isoforms set out herein. 
All such sequences may still be useful in practicing the methods of the 
invention provided that they retain sufficient therapeutic target biological 
activity so as to determine the effects of test compounds in regulating such 
10 activity. Such understanding is available to those skilled in the art based on 
the teachings disclosed herein. 

Further in accordance with the present invention, the term "percent 
identity" or "percent identical," when referring to a sequence, means that a 
15 sequence is compared to a claimed or described sequence after alignment of 
the sequence to be compared (the "Compared Sequence") with the described or 
claimed sequence (the "Reference Sequence"). The Percent Identity is then 
determined according to the following formula: 

20 Percent Identity = 100[1-(C/R)] 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of alignment between the Reference 
Sequence and the Compared Sequence wherein (i) each base or amino acid in 

25 the Reference Sequence that does not have a corresponding aligned base or 
amino add in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
constitutes a difference; and R is the number of bases or amino acids in the 

30 Reference Sequence over the length of the alignment with the Compared 
Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 
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If ar^pgnment exists between the Comp^Jf Sequence and the 
Reference Sequence for which the percent identity as calculated above is about 
equal to or greater than a specified minimum Percent Identity then the 
Compared Sequence has the specified minimum percent identity to the 
5 Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
Identity. 



As used herein, the terms "portion," "segment," and "fragment, " when 
10 used in relation to polypeptides, refer to a continuous sequence of nucleotide 
residues, sequence forms a subset of a larger sequence. Such terms include 
the products produced by treatment of said polynucleotides with any of the 
common endonucleases, or any stretch of polynucleotides that could be 
synthetically synthesized. These may include exonic and intronic sequences of 
1 5 the corresponding genes. 



Therapeutic target biological activity" (or just "biological activity") as 
used herein is a very broad term that relates to all the directly or indirectly 
measurable and identifiable biological activities of the therapeutic target gene 
20 and protein, including but not limited to those examples set out in (a) - (d) 
below: 



(a) Relating to the purified therapeutic target protein, therapeutic target 
biological activity includes, but is not limited to, all those biological processes, 
25 interactions, or binding of ligands, proteins, membrane components or other 
compounds (such as small organic compounds), binding behavior, binding- 
activity relationships, pKa, pD, enzyme kinetics, stability, and functional 
assessments of the protein. 



(b) Relating to therapeutic target biological activity in cell fractions, 
reconstituted cell fractions or whole cells, these activities include, but are not 
limited to the ligand or antibody binding behavior and all measurable 
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consequenceHthis effect, such as measurement of^^ signaling cascade, 
efflux, influx or accumulation, stability or degradation of cellular components, 
membrane composition and behavior, cell growth, development or behavior 
and other direct or indirect effects of therapeutic target activity. 

5 

(c) Relating to therapeutic target genes and transcription, therapeutic 
target biological activity includes the rate, scale or scope of transcription of 
DNA to generate therapeutic target mRNA or its alternate transcripts; the 
effect of regulatory proteins on such transcription, the effect of modulators of 
10 such regulatory proteins on such transcription; plus the stability and behavior 
of mRNA transcripts, post-transcription processing, mRNA amounts and 
turnover, and all measurements of translation of the mRNA into polypeptide 
sequences. 



15 ( d ) Relating to therapeutic target biological activity in organisms, this 

includes but is not limited to biological activities which are identified by their 
absence or deficiency in disease processes or disorders caused by aberrant 
therapeutic target biological activity in those organisms. Broadly speaking, 
therapeutic target biological activity can be determined by all these and other 

20 means for analyzing biological properties of proteins and genes that are 
known in the art. 



25 



DETAILED DESCRIPTION OF THE INVENTION 



In accordance with the present invention, a gene and protein 
associated with maintenance of HDL levels in humans has been determined 
30 by identification of mutant forms of the gene in families having low HDL levels 
^ using the positional cloning strategy set out in Example 1 . 
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The jj^gree of family BC-11M is shown in 1 . The phenotype in 

the family used to define the linkage segregates a low HDL 
(hypoalphalipoproteinemia) trait in an autosomal dominant fashion with high 
penetrance. Other lipid parameters are normal in this family (including 
5 triglyceride (TG) and low density lipoprotein (LDL)) and there is an absence of 
other confounders (such as obesity and diabetes). There is strong evidence 
for cardiovascular disease and coronary artery disease in particular in this 
family as multiple members have had cardiac events (myocardial infarct (Ml), 
stroke and angina) and have been treated for symptoms of this disease (such 
1 0 as with coronary artery bypass graft (CABG)). One member of this family has 
Behcet syndrome (related to lupus and auto-immune disorders; another 
member has celiac disease (gluten intolerance) and others have thyroid 
problems. 

15 Members of this family had the wild type sequence for coding regions 

of ABCA1 , thus distinguishing them from the previously identified low-HDL 
causing mutations in ABCA1. In addition, the ABCA1 gene is excluded based 
on the genetics of this family, and based on normal cholesterol efflux 
measurements in cultured fibroblasts from affected individuals in this family. 



20 
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The pedigree of family NL-003 is shown in Figure 2. The proband 
(ll:35) has an HDL level below the fifth percentile (age and sex corrected). 
There are a few cases of overweight/obesity (BMI>25>30) but this condition 
does not fully co-segregate with the HDL condition and so may not be related. 



The pedigree of family NL-120 is shown in Figure 3. Again the proband 
(11:01) has an HDL level below the fifth percentile (age and sex corrected). In 
both NL-003 and NL-120 multiple individuals with low HDL are observed. In 
family NL-003 individuals with low HDL and cardiovascular disease and 
30 cardiac events (Ml) are observed. 
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Thus.^J^genes identified according to the in^^ion are located on 
chromosome 9, at or near the 9q22 band. Markers for this locus have been 
identified and are as follows: 9q22ca7 (SEQ ID NO: 1) and 9q22ca17 (SEQ 
ID NO: 2). Other markers are available in the public domain. 



Description of Gene95 and its mutations 



Gene95 is alternately known and described herein as P1G95, gene95, 
10 G95 or as HDL Regulatory Protein. Aspects of Gene95 are described at a 
variety of different sources in public databases. 

Goldenpath Gene ID: ENSG000001 36925, including transcript ID: 
ENST00000259452. (GoldenPath assembly of the Human Genome Project 
15 Working Draft and found at www.genome.ucsc.edu or at the Ensembl web- 
site www.ensembl.org, version effective August 2001). 



Gen Bank Accession Numbers: AK056453, XM_088573 (Available at 
www.ncbi.nlm.nih.gov/entrez/query.fcqi). Those skilled in the art will recognize 
20 that these references do not provide the correct full length nucleotide coding 
sequences for gene95, nor do they provide the correct amino acid sequence 
of gene95 when translated. 



The full length cDNA sequence for gene95 (P1G95) is set out in SEQ 
25 ID NO: 3 comprising a 4333 bp transcript including 5'UTR and 3' tail. The 
exon/intron structure of gene95 is set out in Figure 4. The gene comprises 10 
exons with possible alternate transcripts for exon 6 ("exon 6 alt" or E6a) and 
exon 9 ("exon 9 alt" or E9a). Figure 4 illustrates the expected 10 exon 
construct and other possible transcripts employing E6a or E9a. 

30 

The gene95 protein comprises a 516 amino acid protein (SEQ ID NO: 
4). Use of an alternate methionine start site may generate a sequence of 489 



11 



WO 2004/003159 PCT/US2003/020410 

amino acid^^nalysis of the protein sequence of g4P^5 suggests it has the 
characteristics of a soluble protein which is involved in maintaining serum lipid 
homeostasis in humans. The protein is believed to contain one or more of the 
following features: N-glycosylation site (PS00001); Glycosaminoglycan 
5 attachment site (PS00002); 4 cAMP- and cGMP-dependent protein kinase 
phosphorylation sites (PS00004); 12 Protein kinase C phosphorylation sites 
(PS00005); 8 Casein kinase II phosphorylation sites (PS00006); 8 N- 
myristoylation sites (PS00008); Zinc finger C2H2 type domain (PS50157); 
Cysteine-rich region (PS50311). 

10 

For SEQ ID NO: 3, residues 1-400 represent the 5'-UTR 
(untranslated region) while the 3'UTR comprises residues 1952-end. For 
the mutant sequence of SEQ ID NO: 6, showing the T1 802G mutation, 
the 5'-UTR region is shown as residues 1-400 and the 3'-UTR region as 
15 residues 1952-end. For the mutant sequence of SEQ ID NO: 8, showing 
the 1706C1707 mutation, the 5'-UTR region is shown as residues 1-400 
and the 3'-UTR region as residues 1952-end. 



Gene95 has a Rhodanese-like domain most closely related to group of 
20 hypothetical, uncharacterized bacterial proteins. An examplary rhodanese-like 
domain is found at amino acids 304 - 406. There is no similarity of this gene 
to any other human gene, other than hypothetical or "rhodanese-like" or 
"sulfurtransferase-related" proteins. Rhodanese domains are found in a 
thiosulfate sulfurtransferase (e.g. Rhodanese; IPR001307). A public database 
25 search identified 21 hits in human genome with Rhodanese domains, 1 being 
a platelet adhesion molecule, the others being phosphatases. Other domain 
signatures such as a DnaJ central domain (e.g. [CXXCXGXGJ4; amino acids 
422-480), XS Zinc finger domain (e.g. amino acids 458-465) and integrase 
site (e.g. amino acids 452- 465) are also suggested. 

30 

Figure 5 sets out the nucleotide and amino acid sequences of 
individual exons, and some sequence from the promoter (genomic) and intron 
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regions (espel 




splice site junctions). The Gene95 




uence spans about 



34kb of genomic DNA. Introns which are not fully set out may be obtained 
from the public human genome sequence database Goldenpath. Lower case 
letters indicate intron and promoter regions, while upper case letters indicate 
5 nucleotides in the cDNA or mRNA or amino acids, as dictated by the context. 
Exon 1 and Exon 2 have one or more asterisks ("*"). These are putative stop 
codons. The true initiator methionine is believed to be the first methionine in 
Exon 2 following the asterisk. Asterisks are also found in exon 10. The first 
asterisk is the expected termination site, generating a polypeptide of 516 
10 amino acids. The remaining amino acid translations are illustrated for the sake 
of completeness and are not believed to lead to a functioning protein. Exon 6a 
and Exon 9a contain termination codons. If these alternate exons are used, 
they are believed to result in the truncated proteins, as indicated. 

15 Homology: Among full length proteins, the closest homologs to human 

Gene95 are the shared 29 to 34 % identity of Rhodanese-related 
sulfurtransferases (Salmonella typhimurium and Brucella melitensis) and 
dihydrofolate reductase (Schizosaccharomyces pombe). Hypothetical 
bacterial proteins of highest homology include: Q9RVC9J3EIRA/1 19-216; 

20 Q55613_SYNY3/1 13-210; Q9Z7H1_CHLPN; 084632_CHLPN/1 13-212; 
084632_CHLTR/1 13-212; 0341 31J-ACLA/1 16-21 3; 031 457_BACSU/1 16-213 
(YBFQ protein). 



25 ID NO: 5), the first non-human mammalian homolog of gene95. The sequence 
shares approximately 76% sequence identity across the full length of the 
protein with human gene95. Human and mouse amino acid sequences are 
aligned in Figure 6. Shared amino acids are indicated, along with conservative 
and non-conservative changes. 



This invention also discloses and claims mouse gene95 protein (SEQ 



30 
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Add^^l investigations of short nucleic acid^J^uence segments from 
other mammals indicate that at the nucleotide level, mammalian species 
share a generally conserved gene95 sequence. Specifically: 



5 Human:Cow 514/565 (90%) identical based on overlapping sequence 
Human:Rat 338/401 (84%) identical based on overlapping sequence 
HumanrPig 228/265 (86%), identical based on overlapping sequence 
Human:Mouse 266/307 (86%) identical based on overlapping sequence 



10 Expression pattern: Based on EST evidence, the gene is shown to be 

expressed in the following tissues and cell lines: teratocarcinoma, brain, 
placenta, ovarian tumor, lymphoma, melanoma, neuroblastoma. The data of 
Examples 2 and 3 confirm expression in a wide range of tissues. 

15 Mutation 1, found in the family BC-11M, was identified as a 1802T>G 

(SEQ ID NO: 6) causing a Ser468Ala mutation SEQ ID NO: 7. The mutation 
was present in 13 affected family members and not present in 10 unaffected 
family members. The mutation was not found in 198 control chromosomes, 
nor in 164 chromosomes from individual with high HDL. Both control cohorts 

20 were of similar ethnicity to the BC-1 1 M family. The mutation may result in loss 
of a putative Protein Kinase A phosphorylation site (PKA- basic-basic-neutral- 
Ser). This mutation alters a serine residue that is conserved in the mouse 
protein. 



25 Mutation 2, from an individual in the NL-003 family with HDL level <5 th 

percentile comprises a 1706C1707 insertion SEQ ID NO: 8. The insert is in a 
second base of codon for Q436 resulting in Q436P and 39 additional unique 
amino acids producing a premature stop codon, 475aa total (SEQ ID NO: 9 
and counting from the methionine at residue 14). Mutation 2 is also observed 

30 in an individual from family NL-120 with HDL <5 th percentile. Mutation 2 was 
not observed in 228 control chromosomes of similar ethnicity. 
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Alignmelrc of the amino acid sequence of mutation 1 and mutation 2 
with the wild-type amino acid sequence of human gene95 is set out in Figure 

7. 



gene95 and other polymorphisms have been identified and are set out in 
Table 1. These cSNPs result in the amino acid changes identified, if any. 
Other cSNPs and polymorphisms in exonic or intronic regions of gene95 can 
also be identified by those skilled in the art. 



Because of the processing that may take place in transforming the 
initial RNA transcript into the final mRNA, the nucleotide sequences disclosed 
herein may represent less than the full genomic sequence. Such genes and 
cDNA sequences are still considered corresponding sequences because they 

15 both encode similar RNA sequences. Thus, by way of non-limiting example 
only, a gene that encodes an RNA transcript, which is then processed into a 
shorter mRNA, is deemed to encode both such RNAs and therefore encodes 
an RNA complementary to (using the usual Watson-Crick complementarity 
rules), or that would otherwise be encoded by, a cDNA (for example, a 

20 sequence as disclosed herein). Thus, the sequences disclosed herein 
correspond to genes contained in the cells and are used to determine relative 
levels of expression because they represent the same sequences or are 
complementary to RNAs encoded by these genes. Such genes also include 
different alleles and splice variants that may occur in the cells used in the 

25 processes of the invention. 

Thus, the polynucleotides, such as the gene sequences disclosed 
herein, for use in the screening assays of the invention "correspond to" 
gene95 if the polynucleotide encodes an RNA (processed or unprocessed, 
30 including naturally occurring splice variants and alleles) that is at least 50% 
identical, preferably at least 70%, 80%, 90% identical, and more preferably at 
least 95%, or 98% identical to, and especially having the sequence of, an 



5 



Single Nucleotide Polymorphisms of the coding region (cSNPs) of 



10 



15 



WO 2004/003159 



PCT/US2003/020410 



RNA that 



Id be encoded by, or be complei 



:ary to, such as by 



hybridization with, a polynucleotide having SEQ ID NO: 3. In addition, genes 
including sequences at least 50% identical to SEQ ID NO: 3, preferably at 
least about 70%, 80%, 90% identical to such a sequence, more preferably at 

5 least about 98% identical to such sequence and most preferably comprising 
such sequences as are specifically contemplated by all of the processes of 
the present invention as being genes that correspond to these sequences. In 
addition, sequences encoding the same polypeptides and proteins as any of 
these sequences, regardless of the percent identity of such sequences, are 

0 also specifically contemplated by any of the methods of the present invention 
that rely on any or all of said sequences, regardless of how they are otherwise 
described or limited. Thus, any such sequences are available for use in 
carrying out any of the methods disclosed according to the invention. Such 
sequences also include any open reading frames, as defined herein, present 

5 within gene95. Such sequences include the mutant gene95 sequences of 
SEQ ID NO: 6 and SEQ ID NO: 8, and sequences containing the alternative 
exons shown in Figure 5. 

In accordance with the forgoing, the present invention encompasses an 
0 isolated polynucleotide comprising a polynucleotide sequence or the full 
complement of this polynucleotide sequence, wherein the polynucleotide 
sequence is at least 65%, preferably at least 75%, more preferably at least 
85%, especially at least 90%, most preferably 95%, or even 98%, identical to 
SEQ ID NO: 3, with the sequence of SEQ ID NO: 3 especially preferred. The 
5 invention also includes an isolated polynucleotide comprising a polynucleotide 
sequence that encodes a polypeptide having the amino acid sequence set 
forth in SEQ ID NO: 4. 

Relating to the gene95 protein (SEQ ID NO: 4), the invention includes 
0 all those proteins, polypeptides and amino acid sequences corresponding to 
SEQ ID NO: 4. Such sequences corresponding to SEQ ID NO: 4 include any 
full length protein or fragment thereof which retains a biological activity of 
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gene95 protJBfeuch sequences are preferably at lei^^bout 50% identical 
to SEQ ID NO: 4, more preferably about 60%, 70%, 80% and most preferably 
at least about 90%, or 95%, identical to SEQ ID NO: 4. Such sequences 
include any ortholog or homolog of gene95 protein from any organism, 
5 preferably a eukaryote, more preferably a mammal, most preferably a rodent 
or human; and any amino acid sequence having at least about 50%, 60%, 
70%, 80% and most preferably at least about 90% sequence identity thereto. 
Such sequences include mutant sequences SEQ ID NO: 7 and SEQ ID NO: 
9, sequences corresponding to alternative open reading frames of gene95 
10 nucleotide sequences, and sequences found in the alternatively spliced exons 
in Figure 5. 



In accordance with the foregoing, the present invention includes an 
isolated polypeptide comprising an amino acid sequence having at least 65%, 
1 5 preferably at least 75%, more preferably at least 85%, especially at least 90%, 
most preferably 95%, or even 98%, identity with the amino acid sequence set 
forth in SEQ ID NO: 4, wherein a polypeptide comprising the sequence of 
SEQ ID NO: 4 is especially preferred. 

20 The present invention also relates to nucleic acid vector comprising 

such an isolated polynucleotide and a recombinant host cell comprising such 
a vector. The present invention also encompasses a method for producing the 
polypeptide of SEQ ID NO: 4 comprising culturing the host cell of claim 49 
under conditions supporting production of the polypeptide, methods of which 

25 are all well known to those skilled in the art in light of the disclosure set forth 
herein. 



Screening Assays using Gene95 

30 

The present invention provides screening assays using gene95 gene 
and/or protein for use in identifying therapeutic agents, especially agents 
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antagonisti^^low HDL levels. One such protocol ^Bfcves the screening of 
chemical agents for ability to modulate, either upward or downward, the 
activity of gene95 so as to elevate HDL activity, such as by increasing HDL 
levels, especially plasma levels, in an animal, including humans. Agents that 
5 alter the activity of the gene95 gene, or active fragments or portions of it, or 
that may modulate the activity of polypeptides encoded by gene95, or 
polypeptides that act as transcription factors to modulate the activity of such 
genes, or other related gene segments, such as enhancers or other 
regulatory genetic elements that modulate the activity of gene95, acting either 

10 in cis or trans fashion, are thereby identified and may prove useful in 
preventing, treating, or otherwise ameliorating, low HDL levels, or the effects 
of low HDL levels or diseases related to low HDL levels, such as by 
ameliorating reverse cholesterol transport or other lipid related processes, 
and/or for diseases such as cardiovascular disease, atherosclerosis, and 

15 other diseases. 



In accordance with the invention disclosed herein, there are provided 
compositions and methods for treating patients having low HDL-C, vascular 
disease, dyslipidemia, atherosclerosis or a related disorder, or for 
20 ameliorating reverse cholesterol transport in the body, by administering 
compounds that modulate biological activity or expression of gene95. 

The present invention specifically contemplates screening assays, 
such as where a large number of compounds are to be screened for activity in 
25 modulating therapeutic target biological activity. As to all such assays as 
disclosed herein, such modulation may include either an increase or a 
decrease in therapeutic target biological activity. 



Those skilled in the art are able to identify measurable biological 
30 activities of a therapeutic target which can be usefully incorporated into low or 
high throughput screening assays. Non-limiting examples of such assays are 
described herein for illustration purposes. Based on these teachings, other 
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embodiments^lherapeutic target screening assays wfce directly reduced 
to practice. 

The invention also provides assays for the identification of such 
5 therapeutic compounds and their analogs. Compounds that modulate the 
biological activity of gene95 gene or protein are considered useful in the 
invention. Exemplary screening assays (in vitro or in vivo) and computational 
assays (in silico) for the identification of such compounds are detailed below. 
The screening assays of the invention simplify the evaluation, identification 
0 and development of therapeutic agents for the treatment and prevention of 
low HDL and/or vascular diseases, dyslipidemias or related disorders. In 
general, the screening methods provide a facile means for selecting natural 
product extracts or synthetic compounds of interest from a large population 
(i.e. a chemical library) which are further evaluated and condensed to a few 
5 active core structures. Multiple analogs of such core structures may be 
developed and tested to identify those preferred analogs which have 
improved characteristics as therapeutic agents. 

The present invention also relates to a method for identifying an agent 
that modulates the activity of a polypeptide whose activity alters high density 
lipoprotein (HDL)-activity, comprising: 

a) contacting a candidate compound with a polypeptide encoded by a 
polynucleotide corresponding to gene95 under conditions facilitating such 
activity; and 

b) determining a change in the biological activity of said polypeptide as 
a result of said contacting; 

wherein said change in activity identifies the test compound as an 
agent that modulates said polypeptide biological activity. 

In a one embodiment of this process, the change in biological activity in 
step (b) is a decrease or increase in activity of the polypeptide. In a preferred 
embodiment, the activity is measured by measuring the activity of an enzyme, 
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such as wh J^he polypeptide itself has enzyme ac^B^ that can be directly 
or indirectly assayed. Such assays may be conducted in vitro or in vivo. 

In a preferred embodiment, said polypeptide has the amino acid 
5 sequence of SEQ ID NO: 4; in an alternative embodiment said polynucleotide 
has SEQ ID NO: 3. 



In another embodiment, the polypeptide is present in a cell extract, a 
liposome or an intact cell. The present invention specifically contemplates 

10 embodiments in which the cell is a recombinant cell line containing a 
heterologous gene95 construct. In other embodiments, said cell or extract is 
engineered by other than genetic engineering, such as where the activity of a 
polypeptide is to be enhanced and the cell has been engineered to contain, or 
have on its surface, said polypeptide but wherein the polypeptide is present 

1 5 due to physical insertion of the polypeptide into the membrane or cytoplasm of 
the cell and not through expression of a gene contained in the cell. 



The present invention also relates to a method for identifying an agent 
that modulates a Gene95 activity, comprising: 
20 a) contacting a test compound with a polypeptide encoded by a 

polynucleotide corresponding to gene95 under conditions supporting an 
activity of said polypeptide; and 

b) determining a change in the activity of the polypeptide as a result of 
said contacting; 

25 wherein said change in activity identifies the test compound as an 

agent that modulates a Gene95 activity. 



Such determined change in activity in step (b) may be a decrease or an 
increase in activity. In one embodiment, the activity is measured by measuring 
30 the activity of an enzyme. In other embodiments of such method, the 
polypeptide is present in a lipid bilayer, including where this lipid bilayer is part 
of a liposome. In addition, the polypeptide may be part of an intact cell, such 
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as where th ct cell is a cell that has been engin^fed to comprise said 
polypeptide, such as where the intact cell is a recombinant cell that has been 
genetically engineered to express said polypeptide, preferably where the cell 
does not express said polypeptide absent said engineering. In all cases, the 
5 intact cell is preferably a mammalian cell. 



The present invention also relates to a method for identifying an HDL- 
enhancing agent, comprising administering to an animal an effective amount 
of an agent found to have modulating activity using any of the assays 
10 disclosed herein and detecting an increase in plasma HDL activity in said 
animal due to said administering thereby identifying an agent useful in 
enhancing HDL activity. 

In a preferred embodiment, said animal exhibits low HDL activity prior 
15 to administering said agent. In another preferred embodiment, the HDL- 
enhancing activity is an increase in HDL level in said animal, most preferably 
an increase in plasma HDL level and preferably where said animal is a human 
patient. 

20 The present invention additionally relates to a method for treating a 

low-HDL related disorder in an animal afflicted with said disorder comprising 
administering to said animal an effective amount of an agent found to have 
HDL-enhancing activity using any of the assays of the invention. In a 
preferred embodiment, the animal to be treated is a human patient. In an 

25 additional preferred embodiment, the disorder to be treated is selected from 
the group consisting of low HDL diseases, vascular diseases and 
dyslipidemias. 



The agents identified according to the processes of the invention are 
30 also useful for treating or preventing coronary artery disease, regardless of 
the HDL status of the patient For example, a patient with normal HDL levels 
who has a family history of coronary artery disease would still be advised to 
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take a thei 




:ic agent according 



to the inventioi 



fder to elevate HDL 



levels and further reduce the risk of coronary artery disease. Thus, the patient 
does not need to have a dyslipidemia in order to be eligible for treatment 
according to the invention. 



In general, the screening methods of the invention involve screening 
any number of compounds for therapeutically active agents by employing any 
number of in vitro or in vivo experimental systems. Exemplary methods useful 
for the identification of such compounds are detailed herein. 



The methods of the invention simplify the evaluation, identification and 
development of active agents for the treatment and prevention of low HDL 
levels as well as cardiovascular disease (CVD) and dyslipidemias. In general, 
the screening methods provide a ready means for selecting natural product 

15 extracts or compounds of interest from a large population which are further 
evaluated and condensed to a few active and selective materials. Positive 
candidates from this pool are then purified and evaluated in the methods of 
the invention to determine their HDL-raising, anti-CVD or anti-dyslipidemia 
activities or both. The positive candidates may be used directly as therapeutic 

20 agents, or they may provide informative structures for structure-activity 
relationship (SAR) analysis and development of further analogs which then 
become the preferred therapeutic agents. 

25 Assessment of Gene 95 as a sulfurtransferase responsible for HDL 
regulation. 

While the human genetics data establishes that the function of Gene 
95 is to maintain HDL levels in humans, the specific mechanism of action of 
30 Gene 95 remains to be established. Without being bound to any theory for a 
mechanism of action, the inventors suggest that Gene 95 may function as a 
sulfurtransferase, or similar type enzyme. This is based on the comparison of 
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the Rhodane^Womain found in the protein. Rhodar^e domains may be 
classified as active for sulfurtransferase activity, active for phosphatase 
activity or catalytically inactive. Human and mouse G95 contain consensus 
for sulfurtransferase within the rhodanese domain. Possible consensus for 
5 phosphatase within rhodanese domain is found in the human but not 
conserved in mouse G95. In addition, one other human gene with this 
consensus has been annotated as a sulfurtransferase activity. 



10 For example, thioredoxin binding protein Txnip is known to be involved in lipid 
metabolism based on the hyperlipidemic mouse phenotype. 

It is understood that screening assays employing a gene 95 
sulfurtransferase biological activity may be designed using a known substrate 

15 for sulfurtransferases, such as cyanide, thioredoxin and dihydrolipoate. 
Alternatively, screening assays may be designed based on previously 
disclosed sulfurtransferase assays. In one colorimetric assay for thiocyanate 
production, the assay measures conversion of thiosulfate and cyanide to 
sulfite and thiocyanate. In an assay for NADPH oxidation by thioredoxin 

20 reductase, the assay measures conversion of thiosulfate and thioredoxin to 
sulfite and thioredoxinpersulfide. Compounds which agonize or antagonize 
these conversions via Gene 95 interaction are modulators of the invention. 



25 encoded by one or more of the polynucleotides disclosed herein as 
corresponding to gene95, including candidates with high homology thereto, 
such as polynucleotides at least 90% or 95% identical to such genes or 
polypeptides with high homology to the polypeptides encoded by said genes. 
Such assays may employ drug screening technology such as (but not limited 

30 to) the ability of various dyes to change color in response to changes in assay 
conditions resulting from the activity of the polypeptides. 



Several types of evidence connect these activities to HDL metabolism. 



Functional assays may be based on the activity of a polypeptide 
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Drug s^fcning assays can also be based up^fee ability of gene95 
polypeptide to interact with other proteins. Such interacting proteins can be 
identified by a variety of methods known in the art, including, for example, 
radioimmunoprecipitation, co-immunoprecipitation, co-purification, and yeast 
5 two-hybrid screening. The ability of test compounds to agonize or antagonize 
protein-protein interactions is a standard type of screening assay that may be 
employed. Such interactions can be assayed by means including but not 
limited to fluorescence polarization or scintillation proximity methods. Drug 
screens can be based upon a function or feature apparent upon creation of a 

10 transgenic or knockout mouse, or upon overexpression of the protein or 
protein fragment in mammalian cells in vitro. Moreover, expression of 
mammalian (e.g., human) gene95 polypeptides in yeast or C. elegans allows 
for screening of candidate compounds in wild-type and mutant backgrounds, 
as well as screens for mutations that enhance or suppress a low HDL 

15 phenotype. Modifier screens can also be performed in transgenic or knock-out 
mice. 



It will be evident to those skilled in the art that the disclosure herein 
20 facilitates incorporation of Gene95 into standard screening assays for 
identifying compounds that interact with and/or modulate gene95. Such 
assays include protein/compound binding assays, gene expression assays 
and many other types. Along with standard assays, known features of 
Gene95 suggest more specialized assays that may be employed. For 
25 example, the putative phosphorylation sites suggest that an assay which 
measures a test compound's propensity to influence the rate, amount or 
timing of phosphorylation of gene95 would be a useful modulator of the 
invention, and a potential therapeutic agent or analog thereof for the diseases 
indicated herein. Features of gene95 which can be exploited for use in 
30 screening include the N-glycosylation site, the Glycosaminoglycan attachment 
site; the 4 cAMP- and cGMP-dependent protein kinase phosphorylation sites; 
the 12 Protein kinase C phosphorylation sites; the 8 Casein kinase II 
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phosphorylatiWFsites; the 8 N-myristoylation sites; the^nc finger C2H2 type 
domain; and the Cysteine-rich region. Other features that may be useful for 
measurement, include the activity of the Rhodanese-like domain, the DnaJ 
central domain, the XS Zinc finger domain and the integrase site. 

5 

In accordance with the foregoing, the present invention relates to a 
method for identifying an agent that modulates the activity of a polynucleotide 
whose expression alters high density lipoprotein (HDL)-activity in vivo, 
comprising: 

10 a) contacting a test compound with a polynucleotide corresponding to 

gene95 under conditions facilitating expression of the polynucleotide; and 

b) determining a change in the expression of the polynucleotide as a 
result of said contacting; 

wherein said change in expression identifies the test compound as an 
15 agent that modulates the activity of a polynucleotide whose expression alters 
high density lipoprotein (HDL)-activity. 

In specific embodiments of such a process, the change in expression in 
step (b) may be a decrease or increase in expression of said polynucleotide 
20 or gene. 

In another aspect, the present invention relates to a method for 
identifying an agent that modulates Gene95 activity, comprising: 

a) contacting a test compound with a genetic construct comprising a 
25 reporter gene operably linked to a Gene95 promoter under conditions 

supporting transcription of said reporter gene; 

b) determining a change in transcription of the reporter gene as a result 
of said contacting 

wherein a change in said transcription indicates that the test compound 
30 is an agent that modulates Gene95 activity. 
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Sucn^^termined changes may be an inc or decrease in the 
recited transcription and wherein transcription is determined by measuring the 
amount of an expression product encoded by said reporter gene, such 
expression product including either an RNA or a polypeptide. In preferred 
5 embodiments, the reporter gene may be present in a liposome or in an intact 
cells, such as a mammalian cell. Preferred embodiments of such methods 
include use of a mammalian Gene95 promoter, such as a human promoter, 
preferably that shown in SEQ ID NO: 15, or a mouse promoter, preferably the 
promoter sequence in SEQ ID NO: 14. In all cases, the reporter gene is 
1 0 commonly a gene whose expression is easily measured and will be other than 
Gene95 itself, although the latter is not excluded for use as a reporter gene. 



Such a process is especially useful for identifying agents effective 
against low HDL levels and disorders, and symptoms thereof. Such agents or 

15 their chemical analogs are also useful for modulating gene95 gene or protein 
and/or for treatment or prophylaxis of cardiovascular disease. An example of 
a construct comprising a human Gene95 promoter sequence is shown in SEQ 
ID NO: 15, wherein the promoter sequence (about the first 1500 bases) is at 
the start of the sequence and is followed by exon 1 (starting at residue 1502) 

20 after which is intron 1 with exon 2 at the end of the sequence, while a 
sequence that incorporates a mouse promoter (about 1500 nucleotide 
residues) is shown in SEQ ID NO: 14, with the promoter at the beginning of 
the sequence, then followed by exonl (starting at residue 1501), followed by 
intron 1 with exon 2 at the end of the sequence. The exonic sequences were 

25 found in mRNA. The present invention encompasses such exon and promoter 
sequences. 



In another aspect, the invention provides a method for computationally 
identifying a compound for modulating HDL levels and/or treating a vascular 
30 disease or dyslipidemia. The method involves (a) determining the active site 
of the gene95 protein (i.e. through X-Ray crystallography or other techniques); 
and (b) through computational modeling, identifying a compound which 
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interacts with m^active site, thereby identifying a compound, or its analog, as 
a compound which is useful for modulating HDL levels and/or treating a 
vascular disease, dyslipidemia or a related disorder. 



5 The present invention further relates to a method for identifying a 

therapeutic agent, comprising: 

a) contacting a chemical agent with a polynucleotide corresponding to 
gene95 under conditions supporting expression of said polynucleotide; 

b) determining a change in the expression of said polynucleotide as a 
1 0 result of said contacting; 

wherein a change in said expression identifies a therapeutic agent. 



In a further embodiment, the present invention encompasses a method 
for identifying a therapeutic agent that modulates the activity of a polypeptide 
1 5 that affects high density lipoprotein (HDL)-activity in vivo, comprising: 

a) contacting a test compound with a polypeptide encoded by a 
polynucleotide corresponding to gene95 under conditions supporting an 
activity of said polypeptide; and 

b) determining a change in the activity of said polypeptide as a result 
20 of said contacting; 

wherein a change in said activity identifies the test compound as a 
therapeutic agent that modulates the activity of a polypeptide that affects HDL 
activity. 



25 In a preferred embodiment thereof, the polypeptide corresponds to 

SEQ ID NO: 4 and/or the polynucleotide corresponds to SEQ ID NO: 3 



Gene95 polypeptide, preferably of mammals, most preferably of rodent 
or human, can be used as an antigen to raise antibodies, including 
30 monoclonal antibodies. Such antibodies will be useful for a wide variety of 
purposes, including but not limited to therapeutic uses for modulating gene95 
biological activity, functional studies and the development of drug screening 
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agents (e.g., small 

organic compounds) on the expression or biological activity of gene95 
polypeptide identified according to the invention can be applied not only in 
basic drug screening, but also in clinical trials. For example, the effectiveness 

> of an agent determined by a screening assay as described herein to increase 
or decrease gene expression, protein levels, or biological activity can be 
monitored in clinical trails of subjects exhibiting low HDL levels due to 
inadequate gene expression, protein levels, or biological activity. Alternatively, 
the effectiveness of an agent determined by a screening assay to modulate 

) expression of gene95, as well as structurally and functionally related genes, 
including genes with high homology thereto, and including protein levels, or 
biological activity can be monitored in clinical trials of subjects exhibiting 
decreased altered gene expression, protein levels, or biological activity. In 
such clinical trials, the expression or activity of the genes or polypeptides 

5 disclosed herein and, preferably, other genes that have been implicated in, for 
example, cardiovascular disease can be used to ascertain the effectiveness of 
a particular drug. 



For example, and not by way of limitation, genes that are modulated in 
cells by treatment with an agent (e.g., compound, drug or small molecule) that 
modulates the activity of gene95, or any expression products thereof, or 
polypeptides that modulate that activity of any of such genes (e.g., identified 
in a screening assay as described herein) can be identified. Thus, to study the 
effect of agents on cholesterol levels or cardiovascular disease, for example, 
in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of gene95 and other genes implicated in similar or related 
disorders. The levels of gene expression can be quantified by Northern blot 
analysis or RT-PCR, or, alternatively, by measuring the amount of protein 
produced, by one of a number of methods known in the art, or by measuring 
the levels of biological activity of polypeptides encoded thereby or other 
genes. In this way, the gene expression can serve as a marker, indicative of 
the physiological response of the cells to the agent Accordingly, this 
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response stalWnay be determined before, and at Various points during, 
treatment of the individual with the agent. 



In a preferred embodiment, the present invention provides a method for 
5 monitoring the effectiveness of treatment of a subject with an agent (e.g., an 
agonist, antagonist, peptidomimetic, protein, peptide, anti-gene95 antibody, 
antisense molecule, nucleic acid, small molecule, or other drug candidate 
identified by the screening assays described herein) including the steps of (i) 
obtaining a pre-administration sample from a subject prior to administration of 

10 the agent; (ii) detecting the level of HDL activity or level of expression of 
gene95 protein, mRNA, or genomic DNA in the pre-administration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) 
detecting the level of expression or activity of gene95 protein, mRNA, or 
genomic DNA or HDL in the post-administration samples; (v) comparing the 

15 level of expression or activity of said protein, mRNA, or genomic DNA in the 
pre-administration sample with that of the corresponding post administration 
sample or samples; and (vi) altering the administration of the agent to the 
subject accordingly. For example, increased administration of the agent may 
be desirable to increase the expression or activity of gene95 or its encoded 

20 polypeptide, or to increase HDL-activity, such as plasma HDL-level, to higher 
levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may be desirable to 
decrease expression or activity of the polypeptide or gene where excess 
expression contributes to lowering of HDL activity or levels. 

25 

The gene95 genes disclosed herein as being involved in HDL 
regulation, such as inducing low HDL levels in an animal, can be used, or a 
fragment thereof can be used, as a tool to express a protein, where such 
genes encode a protein, in an appropriate cell in vitro or in vivo (gene 
30 therapy), or can be cloned into expression vectors which can be used to 
produce large enough amounts of protein to use in in vitro assays for drug 
screening. Expression systems which may be employed include baculovirus, 
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herpes viru^fedenovirus, adeno-associated virusjJlcterial systems, and 
eucaryotic systems such as CHO cells. Naked DNA and DNA-liposome 
complexes can also be used. 

5 Assays of such activity include binding to intracellular interacting 

proteins, interaction with a protein that up-regulates gene or polypeptide 
activity, interaction with HDL particles or constituents, interaction with other 
proteins which facilitate interaction with HDL or its constituents, and 
measurement of cholesterol efflux. Furthermore, assays may be based upon 

10 the molecular dynamics of macromolecules, metabolites and ions by means 
of fluorescent-protein biosensors. Alternatively, the effect of candidate 
modulators on expression or activity may be measured at the level of protein 
production using the same general approach in combination with standard 
immunological detection techniques, such as Western blotting or 

15 immunoprecipitation with a specific antibody. Again, useful cholesterol- 
regulating or anti-CVD therapeutic modulators are identified as those which 
produce an change in activity that correlates with increased HDL levels, 
especially in the plasma of a test subject. 

20 Candidate modulators (i.e., test compounds) may be purified (or 

substantially purified) molecules or may be one component of a mixture of 
compounds (e.g., a combinatorial library or an extract or supernatant obtained 
from cells). In a mixed compound assay, gene expression is tested against 
progressively smaller subsets of the candidate compound pool (e.g., 

25 produced by standard purification techniques, e.g., HPLC or FPLC; Ausubel 
et al.) until a single compound or minimal compound mixture is demonstrated 
to modulate gene or protein activity or expression in a manner having a 
beneficial effect on HDL levels. 



30 Specific compounds which will modulate the gene expression or gene 

transcript levels in a cell of gene95 include antisense nucleic acids, ribozymes 
and other nucleic acid compositions and sequence specific binding 
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compounds v^Bh specifically hybridize with said gen^(including exons or 
introns of such genes) or any RNA transcript of gene95. These specific 
compounds are compounds of the invention, and are useful for treating the 
diseases discussed previously. Design and manufacturing of such 
5 compounds are easily achieved by those skilled in the art based on the 
disclosures of the instant specification. 



Specific compounds which modulate the activity of gene95 polypeptide 
include antibodies (polyclonal or monoclonal) which specifically bind to an 
1 0 epitope of said polypeptide. These specific compounds are compounds of the 
invention, and are useful for treating the diseases discussed previously. 
Design and manufacturing of such compounds are well known to those skilled 
in the art. 



15 Antibodies and antibody fragments that specifically recognize one or 

more epitopes of gene95, or epitopes of conserved variants of gene95, or 
peptide fragments of the gene95 are also encompassed by the invention. 
Such antibodies include but are not limited to polyclonal antibodies, 
monoclonal antibodies (MAbs), humanized or chimeric antibodies, single 

20 chain antibodies, Fab fragments, F(ab')2 fragments, fragments produced by a 
Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding 
fragments of any of the above. These may be used to treat any of the 
disorders disclosed herein. 



25 As is known to those skilled in the art, use of fully humanized 

antibodies is generally preferred because they are generally less 
immunogenic and have a longer half-life when administered to humans. 
Leading available techniques include the UltiMab Human Antibody 
Development System(sm) featured by Medarex, Inc. (Princeton, NJ) which 

30 employs a transgenic mouse having a full suite of human genes for 
w antibodies; and the Xenomouse technology of Abgenix, Inc. (Fremont, CA) 

which uses genetically engineered strains of mice in which mouse antibody 



31 



WO 2004/003159 



PCT/US2003/020410 



gene expret 




is suppressed and functionally repla* 




human antibody 



gene expression, while leaving intact the rest of the mouse immune system. 

These mice can be induced to generate fully humanized polyclonal 
5 antibodies, which are then converted to large-scale production using 
hybridorna technology. In one embodiment, the method includes immunizing a 
transgenic non-human animal, e.g., a transgenic mouse, having a genome 
comprising a human heavy chain transgene and a human light chain 
transgene, with a purified or enriched preparation of the gene95 protein, or a 
10 fragment thereof. B cells (e.g., splenic B cells) of the animal are then 
obtained and fused with myeloma cells to form immortal, hybridorna cells that 
secrete human monoclonal antiboides against the gene95 protein. 



15 an immunogenic portion of an antibody with a corresponding, but non- 
immunogenic portion (i.e. chimeric antibodies) (Robinson, R. R. et al., 
International Patent Publication PCT/U.S.86/02269; Akira, K. et al., European 
Patent Application 184,187; Taniguchi, M., European Patent Application 
171,496; Morrison, S. L. et al., European Patent Application 173,494; 

20 Neuberger, M. S. et al., PCT Application WO 86/01533; Cabilly, S. et al., 
European Patent Application 125,023; Better, M. et al., Science 240:1041- 
1043 (1988); Liu, A. Y. et al. Proc. Natl. Acad. Sci. USA 84:3439-3443 (1987); 
Liu, A. Y. et al., J. Immunol. 139:3521-3526 (1987); Sun, L. K. et al., Proc. 
Natl. Acad. Sci. USA 84:214-218 (1987); Nishimura, Y. et al., Cane. Res. 

25 47:999-1005 (1987); Wood, C. R. et al., Nature 314:446-449 (1985)); Shaw et 
al., J. Natl. Cancer Inst. 80:1553-1559 (1988)). General reviews of 
"humanized" chimeric antibodies are provided by Morrison, S. L. (Science, 
229:1202-1207 (1985)) and by Oi, V. T. et al., BioTechniques 4:214 (1986). 
Suitable "humanized" antibodies can alternatively be produced by CDR or 

30 CEA substitution (Jones, P. T. et al., Nature 321:552-525 (1986); Verhoeyan 
et al., Science 239:1534 (1988); Beidler, C. B. et al., J. Immunol. 141:4053- 
4060 (1988)). 



Humanized antibodies may also be produced, for example by replacing 
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For the production of antibodies, various host animals may be 
immunized by injection with the gene95, a gene95 peptide (e.g., one 
corresponding the a functional domain of the protein), truncated gene95 
5 polypeptides (gene95 in which one or more domains has been deleted), 
functional equivalents of the gene95 or mutants of the gene95. Such host 
animals may include but are not limited to rabbits, mice, goats and rats, to 
name but a few. Various adjuvants may be used to increase the 
immunological response, depending on the host species, including but not 

10 limited to Freund's (complete and incomplete), mineral gels such as aluminum 
hydroxide, surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, 
dinitrophenol, and potentially useful human adjuvants such as BCG (bacille 
Calmette-Guerin) and Corynebacterium parvum. Polyclonal antibodies are 

1 5 heterogeneous populations of antibody molecules derived from the sera of the 
immunized animals. 

There are various expression systems that can be used for the 
production of whole antibodies and antibody fragments. These include 

20 bacterial or mammalian cell culture and transgenic animals or plants. The 
expression system of choice is determined by the intended application and 
the desired yield, as is known by those skilled in the art. For example, animal 
cell culture and transgenic expression systems are desirable if glycosylation 
of the antibody is required, whereas bacterial expression systems are more 

25 efficient for production of unglycosylated antibodies, Fab fragments and the 
like. See Chad, HE and Chamow, SM. 2001. Curr. Opin. Biotech. 12: 188- 
193. 

To generate fully human monoclonal antibodies to gene95, HuMab 
30 mice can be immunized with a preparation of gene95 or a fragment thereof, 
which has been purified or enriched according to any standard method. 
HuMab mice contain a human immunoglobulin gene miniloci that encodes 
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unrearrangl^human heavy (mu and gamma) 4ft kappa light chain 
immunoglobulin sequences, together with targeted muattions that inactivate 
the endogenous mu and kappa chain loci (Lonberg, N. et al. (1994) Nature. 
368(6474): 856-859.). These mice exhibit reduced expression of mouse IgM 
5 or kappa, and in response to immunization, the introduced human heavy and 
light chain transgenes undergo class switching and somatic mutation to 
generate high affinity human IgGkappa monoclonals. 

Preferably mice are 6 - 16 weeks of age upon the first immunization. 

10 Initial immunization is intraperitoneal (IP) with antigen in complete Freund's 
adjuvant, followed by every other week IP immunizations (up to a total of 6) 
with antigen in incomplete Freund's adjuvant The immune response can be 
monitored over the course of the immunization protocal with plasma samples 
obtained by retroorbital bleeds. The plasma can be screened, for example by 

1 5 ELISA or flow cytometry, and mice with sufficient titers of anti-gene95 human 
immunoglobulin can be used for fusions. Mice can be boosted intravenously 
with antigen 3 days before sacrifice and removal of the spleen. It is expected 
that 2-3 fusions for each antigen are needed for success. 



20 For generation of hybrodiomas producing human monoclonal 

antibodies to gene95, the mouse splenocytes are first isolated and fused with 
PEG to a mouse myeloma cell line based upon standard protocols. The 
resulting hybridomas are then screened for the production of antigen-specific 
antibodies. For example, single cell suspensions of splenic lymphocytes from 

25 immunized mice are fused to one-sixth the number of P3X63-Ag8.653 
nonsecreting mouse myeloma cells (ATCC, CRL 1580) with 50% PEG. Cells 
are plated at approximately 2 X 10 s in flat bottom microtiter plate, followed by 
a two week incubation in selective medium containing 20% fetal calf serum, 
18% "653" conditioned media, 5% origen (IGEN), 4 mM L-glutamine, 1 mM L- 

30 glutamine, 1 mM sodium pyruvate, 5 mM HEPES, 0.055 mM 2- 
mercaptoethanol, 50 unite/ml penicillin, 50 mg/ml streptomycin, 50 mg/ml 
gentamycin and 1X HAT (Sigma; the HAT is added 24 hours after the fusion). 
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After two wee(| cells are cultured in medium in whic^he HAT is replaced 
with HT. Individual wells are then screened by ELISA for human anti-gene95 
monoclonal IgM and IgG antibodies. Once extensive hybridoma growth 
occurs, medium is observed usually after 10-14 days. The antibody secreting 
5 hybridomas are replated, screened again, and if still positive for human IgG, 
anti-gene95 antibodies, can be subcloned at least twice by limiting dilution. 
The stable subclones are then cultured in vitro to generate small amounts of 
antibody in tissue culture medium for characterization. 



10 To purify human anti-gene95 antibodies, selected hybridomas can be 

grown in two-litre spinner-flasks for monoclonal antibody purification. 
Supernatants can be filtered and concentrated before affinity chromatography 
with protein A-sepharose. (Pharmacia, Piscataway, NJ). Eluted IgG can be 
checked by gel electrophoresis and high performance liquid chromatography 

15 to ensure purity. The buffer solution can be exchanged into PBS, and the 
concentration can be determined by OD 2 ao using 1.43 extinction co-efficient. 
The monolconal antibodies can be aliquoted and stored at -80°C until 
required for use according to the methods of the invention. 

20 Another composition which can serve to increase HDL levels and/or 

treat the diseases discussed herein, is a nucleic acid construct comprising a 
gene95 gene and combined into a gene-therapy delivery vehicle. Such 
vehicles may be simple aqueous buffers (for naked DNA delivery), viral based 
vehicles or non-viral (especially lipid-based) vehicles. These specific 

25 compounds are compounds of the invention, and are useful for treating the 
diseases discussed previously. Design and manufacturing of such 
compounds easily achieved by those skilled in the art based on the instant 
specification. 

30 Agonists, antagonists, or mimetics found to be effective at modulating 

the level of cellular expression or activity may be confirmed as useful in 
animal models (for example, primates, mice, pigs, dogs, rabbits, or chickens). 
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For example 



!e compound may ameliorate the low 



levels of mouse or 



chicken hypoalphalipoproteinemias. 

A compound that promotes an increase in expression or activity of a 
5 gene or polypeptide that has the function of facilitating HDL production is 
considered particularly useful in the invention; such a molecule may be used, 
for example, as a therapeutic to increase the level or activity of HDL and 
thereby treat a low HDL condition in an animal (for example, a human). 

10 The invention permits the identification of other HDL-related genes 

that may play a role in HDL-activity regulation and thus mutations in such 
genes may be important in studying other genes that are involved in HDL 
levels. Thus, for example, mutations that serve to stabilize gene95 proteins 
that otherwise contribute to elevating HDL levels have value. Such mutations 

15 can be incorporated into any protein therapy or gene therapy undertaken for 
the treatment of low HDL-C levels. Similarly, compounds that increase the 
stability of a wild-type gene95 polypeptide or decrease its catabolism may 
also be useful for the treatment of low HDL-C or any other condition resulting 
from low HDL levels. Mutant forms of gene95 identified herein are also useful 

20 for this purpose. Further mutations and compounds can be identified using 
the methods described herein. 

In one such embodiment, cells expressing a gene95 polypeptide 
having a mutation are transiently metabolically labeled during translation and 

25 the half-life of the said polypeptide is determined using standard techniques. 
Mutations that increase the half-life of the polypeptide are ones that increase 
protein stability. These mutations can then be assessed for HDL-related 
biological activity. They can also be used to identify proteins that affect the 
stability of corresponding mRNA or other proteins. One can then assay for 

30 compounds that act on these factors or on the ability of these factors to bind 
such polypeptides. 
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example, ceHs expressing a wild-tyj^^ene95 polypeptide 



In anol 




are transiently metabolically labeled during translation, contacted with a 
candidate compound, and the half-life of the polypeptide is determined using 
standard techniques. Compounds that increase the half-life of the polypeptide 
5 are useful compounds in the present invention. 

In other embodiments, treatment with an agonist of the invention may 
be combined with any other HDL-raising or anti-CVD or anti-dyslipidemia 
therapies, or other conditions which require HDL management. 



The assays of the invention comprise protein based assays. The 
polypeptide encoded by the gene95 gene (purified or unpurified) can be used 
in an assay to determine its ability to bind another protein (including, but not 
limited to, proteins found to specifically interact with proteins known to be 
15 involved in HDL-activity. Some examples would include ABCA1 protein and 
apolipoprotein. The effect of a compound on that binding is then determined. 

In another type of binding assay for which the polypeptides encoded by 
genes disclosed herein are useful, the gene95 polypeptide (or a polypeptide 

20 fragment thereof or an epitope-tagged form or fragment thereof) is harvested 
from a suitable source (e.g., from a prokaryotic expression system, eukaryotic 
cells, a cell-free system, or by immunoprecipitation from expressing cells). 
The polypeptide is then bound to a suitable support (e.g., nitrocellulose or an 
antibody or a metal agarose column in the case of, for example, a his-tagged 

25 form of said polypeptide). Binding to the support is preferably done under 
conditions that allow proteins associated with the polypeptide to remain 
associated with it. Such conditions may include use of buffers that minimize 
interference with protein-protein interactions. The binding step can be done in 
the presence and absence of compounds being tested for their ability to 

30 interfere with interactions between said polypeptide and other molecules. If 
desired, other proteins (e.g., a cell lysate) are added, and allowed time to 
associate with the polypeptide. The immobilized polypeptide is then washed 
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to remove f^eins or other cell constituents that^lry be non-specifically 
associated with it the polypeptide or the support. The immobilized 
polypeptide is then dissociated from its support, and proteins bound to it are 
released (for example, by heating), or, alternatively, associated proteins are 
5 released from the polypeptide without releasing the latter polypeptide from the 
support. The released proteins and other cell constituents can be analyzed, 
for example, by SDS-PAGE gel electrophoresis, Western blotting and 
detection with specific antibodies, phosphoamino acid analysis, protease 
digestion, protein sequencing, or isoelectric focusing. Normal and mutant 
1 0 forms of such polypeptide can be employed in these assays to gain additional 
information about which part of the polypeptide a given factor is binding to. In 
addition, when incompletely purified polypeptide is employed, comparison of 
the normal and mutant forms of the protein can be used to help distinguish 
true binding proteins. 

15 

In a specific example of such an assay, such assay is performed using 
a purified or semipurified protein or other molecule that is known to interact 
with a polypeptide encoded by a polynucleotide corresponding to gene95. 
This assay may include the following steps. 
20 1. Harvest the gene95 polypeptide and couple a suitable fluorescent 

label to it; 

2. Label an interacting protein (or other molecule) with a second, 
different fluorescent label. Use dyes that will produce different quenching 
patterns when they are in close proximity to each other vs. when they are 

25 physically separate (i.e., dyes that quench each other when they are close 
together but fluoresce when they are not in close proximity); 

3. Expose the interacting molecule to the immobilized polypeptide in 
the presence or absence of a compound being tested for its ability to interfere 
with an interaction between the two; and 

30 4. Collect fluorescent readout data. 
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AnothSWTssay is includes Fluorescent Resonance Energy Transfer 
(FRET) assay. This assay can be performed as follows. 

1. Provide the gene95 protein or a suitable polypeptide fragment 
thereof and couple a suitable FRET donor (e.g.,. nitro-benzoxadiazole (NBD)) 

5 to it; 

2. Label an interacting protein (or other molecule) with a FRET 
acceptor (e.g., rhodamine); 

3. Expose the acceptor-labeled interacting molecule to the donor- 
labeled polypeptide in the presence or absence of a compound being tested 

10 for its ability to interfere with an interaction between the two; and 

4. Measure fluorescence resonance energy transfer. 

Quenching and FRET assays are related. Either one can be applied in 
a given case, depending on which pair of fluorophores is used in the assay. 

Gene95 may act by altering membrane permeability, such as the 
permeability of membranes to ions. Such activity may be assayed for using 
vesicles, such as liposomes or intact cells, wherein such structures comprise 
gene95 polypeptides of the invention, which polypeptides are expressed in 

20 such vesicle, preferably an intact cell, such as a mammalian recombinant cell, 
and the permeability of the membrane of the cell is determined in the 
presence or absence of such expression. In the same way, such permeability 
can then be assayed in the presence and absence of chemical agents known 
to modulate the activity of gene95. Thus, the utility of these agents in 

25 enhancing the activity of proteins known to affect such membrane transport 
can be readily determined. In the same way, the ability of these agents to 
affect the transport of other molecules, such as lipids, especially HDL, across 
such membranes is likewise readily determined. 

30 In performing such assays, the test cell, such as the aforementioned 

mammalian recombinant cell expressing gene95, or a polynucleotide 
corresponding to such gene is loaded with a reporter molecule (such as a 
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fluorescent (J^ndicator whose fluorescent propertie^tange when it binds a 
particular ion) that can detect ions, or alternatively, the external medium is 
loaded with such a molecule. A molecule which exhibits differential properties 
when it is inside the vesicle compared to when it is outside the vesicle may be 
5 used. For example, a molecule that has quenching properties when it is at 
high concentration but not when it is at another low concentration would be 
suitable. Alternatively, the effect of transport of ions (Ca, K, Na, metals and 
the like) across the membrane can be measured if the effect is to quench or 
unquench a dye which has been loaded into the medium or into the cell. The 
10 movement of the charged molecule (either its ability to move or the kinetics of 
its movement) in the presence or absence of a compound being tested for its 
ability to affect this process can be determined. 

In still another assay, uptake of radioactive isotopes into or out of a 
15 vesicle can be measured. The vesicles are separated from the extravesicular 
medium and the radioactivity in the vesicles and in the medium is quantitated 
and compared. 



As already disclosed above, the present invention also relates to 
20 assays that may employ transcription factors for one or more of the genes 
disclosed herein. The association between the polynucleotide to be tested 
and the binding factor may be assessed by means of any system that 
discriminates between protein-bound and non-protein-bound DNA (e.g., a gel 
retardation assay). The effect of a compound on the binding of a factor to 
25 such DNA is assessed by means of such an assay. In addition to binding 
assays, in vitro assays in which the regulatory regions of the gene are linked 
to reporter genes can also be performed. In a preferred embodiment, such 
polynucleotide is one that corresponds to gene95, most preferably wherein 
said polynucleotide has SEQ ID NO. 3. 

30 

The invention includes recombinant cell lines containing a gene95 
gene construct. Recombinant cell lines expressing the corresponding protein 



40 



WO 2004/003159 PCT/US 2003/0204 10 

are tested to^(feitify a relevant biological activity of ^^protein that can be 
modulated by exposure to test compound(s). Compound(s) are systematically 
screened to evaluate whether they modulate the identified biological activity 
and those that effectively do so are then therapeutic agents, or analogs 
5 thereof, according to the invention. 



Recombinant cell lines are also preferred for the preparation of purified 
protein, if a purified protein assay is desired. Those skilled in the art are 
capable of producing recombinant cell lines and extracting protein fractions 
10 containing highly purified proteins. These samples can be used in a variety of 
binding assays to identify compounds which interact with the proteins. 
Compounds that interact are therapeutic agents of the invention, or analogs 
thereof. 

15 The invention also comprises the upstream untranslated regions and 

promoter regions of gene95. A segment of the promoter region is included in 
Figure 5. Larger portions of this promoter region can be now be identified by 
those skilled in the art, using this disclosure. The 5'UTR (untranslated region) 
is disclosed in SEQ ID NO. 3, 6, 8 and Figure 5. Such genomic or 

20 untranslated regions may be included in plasmids which are used in assays to 
identify compounds which modulate the expression of the identified gene. In 
one such assay, the upstream genomic region is ligated to a reporter gene, 
and incorporated into an expression plasmid. The plasmid is transfected into a 
cell, and the recombinant cell is exposed to test compound(s). Those 

25 compounds which increase or decrease the expression of the reporter gene 
are then modulators of the gene/protein, and are considered therapeutic 
agents of the invention. 



The genes disclosed herein may also be involved in regulating 
30 cholesterol efflux. A transport-based assay for such activity can be performed 
in vivo or in vitro. For example, the assay may be based on any part of the 
reverse cholesterol transport process that is readily re-created in culture, such 
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as cholesteraBor phospholipid efflux. Alternatively, fflBissay may be based 
on net cholesterol transport in a whole organism, as assessed by means of a 
labeled substance (such as cholesterol). 



5 For high throughput screening, fluorescent lipids can be used to 

measure lipid efflux. For phospholipids, a fluorescent precursor, C6-NBD- 
phosphatidic acid, can be used. This lipid is taken up by cells and 
dephosphorylated by phosphatide acid phosphohydrolase. The product, 
NBD-diglyceride, is then a precursor for synthesis of glycerophospholipids like 

0 phosphatidylcholine. The efflux of NBD-phosphatidylcholine can be monitored 
by detecting fluorescence resonance energy transfer (FRET) of the NBD to a 
suitable acceptor in the cell culture medium. Suitable acceptors include 
rhodamine-labeled phosphatidylethanolamine, a phospholipid that is not 
readily taken up by cells. The use of short-chain precursors obviates the 

5 requirement for the phospholipid transfer protein in the media. For cholesterol, 
NBD-cholesterol ester can be reconstituted into LDL. The LDL can efficiently 
deliver this lipid to cells via the LDL receptor pathway. The NBD-cholesterol 
esters are hydrolyzed in the lysosomes, resulting in NBD-cholesterol that can 
now be transported back to the plasma membrane and efflux from the cell. 

0 The efflux can be monitored by the aforementioned FRET assay in which 
NBD transfers its fluorescence resonance energy to the rhodamine- 
phosphatidylethanoline acceptor. 

Animal models are also useful in the assays of the invention. 

15 Compounds identified as having activity in any of the above-described assays 
are subsequently screened in any available animal model system, including, 
but not limited to, pigs, rabbits, and chickens, to see if HDL levels are 
elevated as a result of such compounds being administered in an 
appropriately effective amount to said animal. Test compounds are 

>0 administered to these animals according to standard methods. Test 
compounds may also be tested in mice bearing mutations in interfere with 
cholesterol transport. Additionally, compounds may be screened for their 
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* 





ability to enlWe an interaction between a polyp^rode encoded by a 
polynucleotide corresponding to gene95 and any HDL particle constituent 
such as ApoAl, ApoAl I, or ApoE. 



cholesterol to an extracellular acceptor molecule and are usually dependent 
on the presence of a transporter molecule, such as ABCA1. Thus, the genes 
disclosed herein may play a role in modulating the activity of ABCA1, such as 
where a polypeptide encoded by a polynucleotide disclosed herein binds to, 

10 and modulates, the activity of ABCA1 or other HDL-related protein. In this 
procedure, cells are loaded with radiolabeled cholesterol by any of several 
biochemical pathways (Marcil et al., Arterioscler. Thromb. Vase. Biol. 19:159- 
169, 1999). Cholesterol efflux is then measured after incubation for various 
times (typically 0 to 24 hours) in the presence of HDL3 or purified ApoAl. 

1 5 Cholesterol efflux is determined as the percentage of total cholesterol in the 
culture medium after various times of incubation. Compounds that modulate 
cholesterol efflux in this assay may be acting on the gene95 protein target. 
Preferably, the cell is a recombinant cell line containing a gene95 construct. 

20 The invention also comprises a transgenic animal, such as a mouse, 

that has had one or both alleles of gene95 inactivated (e.g., by homologous 
recombination) or conversely, a mouse having one or more additional copies 
of gene95 or additional mutant copies of gene95. Such mice, including 
transgenic mice, represent a useful animal model for screening for 

25 compounds that raise HDL-C levels. Such mice are also useful for 
experimental purposes to determine the role of gene95 in living systems. 
Such an animal can be produced using standard techniques. In addition to the 
initial screening of test compounds, the animals having different versions of 
such genes are useful for further testing of efficacy and safety of drugs or 

30 agents first identified using one of the other screening methods described 
herein. Cells taken from the animal and placed in culture can also be exposed 



5 



Such cholesterol efflux assay measures the ability of cells to transfer 



43 



WO 2004/003159 



PCT/US2003/020410 



to test coi 




nds. HDL-C levels can be me: 



ed using standard 



techniques, such as those described herein. 

Compounds first identified as useful in elevating HDL activity using one 
5 or more of the assays of the invention may be administered with a 
pharmaceutically-acceptable diluent, carrier, or excipient, in unit dosage form. 
Conventional pharmaceutical practice may be employed to provide suitable 
formulations or compositions to administer such compositions to patients. 
Although oral administration is preferred, any appropriate route of 

10 administration may be employed, for example, parenteral intravenous, 
subcutaneous, intramuscular, intracranial, intraorbital, ophthalmic, 
intraventricular, intracapsular, intraspinal, intracistemal, intraperitoneal, 
intranasal, aerosol, or oral administration. Therapeutic formulations may be in 
the form of liquid solutions or suspension; for oral administration, formulations 

15 may be in the form of tablets or capsules; and for intranasal formulations, in 
the form of powders, nasal drops, or aerosols. 

Typically, a successful gene95 modulating therapeutic agent will meet 
some or all of the following criteria. Oral availability should be at or above 

20 20% (so that at least 20% is absorbed into the blood stream). Animal model 
efficacy should be less than about 2 mg/kg and the target human dose 
between 50 and 250 mg/70 kg, although doses outside of this range may be 
acceptable. The therapeutic index (or ratio of toxic dose to therapeutic dose) 
should be greater than 100. The potency (as expressed by IC50 value) should 

25 be less than about 10 jxM, preferably below 1 pM, and most preferably below 
100 nM. The agent should show reversible binding (i.e., competitive inhibition) 
and should be reasonably selective over unrelated targets, or related targets 
that cause unwanted side effects. The required dosage should be no more 
than about once or twice a day or at meal times. 



Methods well known in the art for making formulations are found in, for 
example, Remington: The Science and Practice of Pharmacy . (19th ed.) ed. 
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10 



15 



20 



25 



A.R. Gennar^AR., 1995, Mack Publishing Company, Easton, PA. 
Formulations for parenteral administration may, for example, contain 
excipients, sterile water, or saline, polyalkylene glycols such as polyethylene 
glycol, oils of vegetable origin, or hydrogenated napthalenes. Biocompatible, 
biodegradable lactide polymer, lactide/glycolide copolymer, or 
polyoxyethylene-polyoxypropylene copolymers may be used to control the 
release of the compounds. Other potentially useful parenteral delivery 
systems for agonists of the invention include ethylenevinyl acetate copolymer 
particles, osmotic pumps, implantable infusion systems, and liposomes. 
Formulations for inhalation may contain excipients, or example, lactose, or 
may be aqueous solutions containing, for example, polyoxyethylene-9-lauryl 
ether, glycocholate and deoxycholate, or may be oily solutions for 
administration in the form of nasal drops, or as a gel. 

In general, novel drugs for the treatment of aberrant cholesterol levels 
and/or CVD or dyslipidemia are identified from large libraries of both natural 
product or synthetic (or semi-synthetic) extracts or chemical libraries including 
combinatorial chemical libraries, according to methods known in the art. 
Those skilled in the field or drug discovery and development will understand 
that the precise source of test extracts or compounds is not critical to the 
screening procedure(s) of the invention. Accordingly, virtually any number of 
chemical extracts or compounds can be screened using the exemplary 
methods described herein. Examples of such extracts or compounds include, 
but are not limited to, plant-, fungal-, prokaryotic- or animal-based extracts, 
fermentation broths, and synthetic compounds, as well as modification of 
existing compounds. Numerous methods are also available for generating 
random or directed synthesis (e.g., semi-synthesis or total synthesis) of any 
number of chemical compounds, including, but not limited to, saccharide-, 
lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound 
libraries are commercially available from Brandon Associates (Merrimack, 
NH) and Aldrich Chemical (Milwaukee, Wl). Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant, and animal extracts are 
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commercial!' 




ailable from a number of sources, ii 




ing Biotics (Sussex, 



UK), Xenova (Slough, UK), Harbor Branch Oceangraphics Institute (Ft 
Pierce, FL), and PharmaMar, U.S.A. (Cambridge, MA). In addition, natural 
and synthetically produced libraries are produced, if desired, according to 
5 methods known in the art, e.g., by standard extraction and fractionation 
methods. Furthermore, if desired, any library or compound is readily modified 
using standard chemical, physical, or biochemical methods. 



10 readily understand that methods for dereplication (e.g., taxonomic 
dereplication, biological dereplication, and chemical dereplication, or any 
combination thereof) or the elimination of replicates or repeats of materials 
already known for their HDL-raising, anti-CVD and/or anti-dyslipidemia 
activities should be employed whenever possible. 



When a crude extract is found to have cholesterol-modulating or anti- 
CVD activities, or both, further fractionation of the positive lead extract is 
necessary to isolate chemical constituent responsible for the observed effect. 
Thus, the goal of the extraction, fractionation, and purification process is the 

20 careful characterization and identification of a chemical entity within the crude 
extract having cholesterol-modulating or anti-CVD activities. The same in vivo 
and in vitro assays described herein for the detection of activities in mixtures 
of compounds can be used to purify the active component and to test 
derivatives thereof. Methods of fractionation and purification of such 

25 heterogeneous extracts are known in the art. If desired, compounds shown to 
be useful agents for the treatment of pathogenicity are chemically modified 
according to methods known in the art. Compounds identified as being of 
therapeutic value are subsequently analyzed using any standard animal 
model known in the art. 



The invention further comprises the use of gene95 protein or protein 
fragments as therapeutic agents, and includes any modified forms of gene95 



In addition, those skilled in the art of drug discovery and development 
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which retain w^biological activity of gene95. These mlrapeutic agents can 
be administered to patients in need thereof, such as patients who would 
benefit by increasing gene95 biological activity either locally, regionally or 
throughout the body, as administered. Such patients include, but are not 
5 limited to, patients suffering from low HDL, dyslipidemia, cardiovascular 
disease and other disorders. 



Functional gene95 protein, and soluble derivatives thereof, such as 
peptides corresponding to functional domains of gene95 or modified versions 
10 of gene95 protein, can be produced by standard methods known in the art. 
Administration to patients can be intravenous, oral, intraperitoneal, or by any 
other method known in the art. 

Diagnostics and Pharmacogenomics with qene95 

15 

Gene expression, both comparable and absolute, as well as biological 
activity, and mutational analysis can each serve as a diagnostic tool for low 
HDL; thus determination of the genetic subtyping of the gene sequence can 
be used to subtype low HDL individuals or families to determine whether the 

20 low HDL phenotype is related to a given protein function, as with the families 
used in accordance with the present invention. This diagnostic process can 
lead to the tailoring of drug treatments according to patient genotype, 
including prediction of side effects upon administration of HDL increasing 
drugs (referred to herein as pharmacogenomics). Pharmacogenomics allows 

25 for the selection of agents (e.g., drugs) for therapeutic or prophylactic 
treatment of an individual based on the genotype of the individual (e.g., the 
genotype of the individual is examined to determine the ability of the individual 
to respond to a particular agent). 

The present invention further relates to processes diagnosing the 
30 presence of a disorder, especially at its earliest stages, or the risk to a patient 
of developing such disorders. Thus, in another aspect, the present invention 
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relates to 




>ss for diagnosing the presence 



• 



disorder in a patient 



suspected of being afflicted therewith comprising detecting a mutation in the 
gene95 gene in the genome of said patient. In a preferred embodiment, the 
present invention relates to such diagnostic processes where the mutation is 
5 the mutation in the nucleotide sequence as depicted in SEQ ID NO: 6 and/or 
SEQ ID NO: 8 and/or Table 1. 

In particular embodiments of such a process, the mutation is detected 
in a sample of DNA taken from said patient. Such a sample may be obtained 
in any manner commonly used to obtain such sample, such as by blood 

1 0 sample, biopsy or other methods. Further in accordance with the processes of 
the invention, such detecting may be accomplished in vivo, such as where an 
in situ procedure is used to detect the presence of a mutated gene95 gene or 
nucleotide sequence in a patient. The disorders that can be so detected are 
not limited to low HDL, dyslipidemia, cardiovascular disease, and the other 

1 5 disorders thereby included, disclosed previously 

In accordance with the foregoing, the present invention relates to a 
process for diagnosing the presence of an gene95-linked disease in a patient 
suspected of being afflicted therewith comprising detecting a mutation in the 
gene95 gene in the genome or mRNA of said patient. 

20 In preferred embodiments, the invention relates to such a process 

wherein said mutation is detected in a sample of DNA taken from the patient 
most preferably wherein the mutation is the mutation of SEQ ID NO. 6 or SEQ 
ID NO. 8 or Table 1. 



25 absence of such a mutation. In preferred embodiments, said detecting is 
accomplished by determining the ability of a nucleic acid probe comprising at 
least 15 contiguous nucleotides that are complementary to a mutated portion 
of the sequence of SEQ ID NO: 1, most preferably wherein said probe 
comprises at least 30 contiguous nucleotides, especially where the probe 

30 comprises at least 50 contiguous nucleotides. 



Many different techniques are known for detecting the presence or 
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The pl^fsses disclosed herein are also usefuWTdetermining the risk 
of a patient in developing one or more of the above disorders, so that the 
present invention also relates to a process for determining a patient's risk of 
developing a disorder where said patient is suspected to be at risk thereof, 
5 comprising detecting a mutation in the gene95 gene in the genome or mRNA 
of said patient. Such detection may, in keeping with the foregoing, be by 
either in vitro or in vivo means. In a preferred embodiment, said disorder is 
low HDL, dyslipidemia and/or cardiovascular disease. In a preferred 
embodiment, the mutation is a mutation in the gene corresponding to SEQ ID 
10 NO: 1. 



It is equally an embodiment of the invention to diagnose a disorder 
using detection of the gene95 protein in a tissue or body sample, especially 
the detection of mutant gene95 protein or the level of mutant or normal 
15 gene95 protein. Such detection may be achieved by a variety of methods, but 
typically involves use of an anti-gene95 antibody in a standard ELISA assay. 
On the basis of this disclosure, those skilled in the art may now develop 
diagnostic tools for detecting mutant or normal gene95 in patient samples, for 
the diagnosis of various disorders. 

20 

Agents, or modulators* having a stimulatory or inhibitory effect on the 
gene95 gene, or gene product, can be administered to individuals to treat 
disorders associated with low HDL-activity or level. In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between 

25 an individual's genotype and that individual's response to a foreign compound 
or drug) of the individual may be considered. Differences in efficacy of 
therapeutics can lead to severe toxicity or therapeutic failure by altering the 
relation between dose and blood concentration of the pharmacologically 
active drug. Thus, the pharmacogenomics of the individual permits the 

30 selection of effective agents (e.g., drugs) for prophylactic or therapeutic 
treatments based on a consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine appropriate dosages 
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and therai 



ic regimens. Accordingly, the gei 



complement of an 



individual with respect to the genes disclosed herein can be determined to 
thereby select appropriate agent(s) for therapeutic or prophylactic treatment of 
the individual. 



Pharmacogenomics deals with clinically significant hereditary variations 
in the response to drugs due to altered drug disposition and abnormal action 
in affected persons (Eichelbaum, M., Clin. Exp. Pharmacol. Physiol., 23:983- 
985, 1996; Under, M. W., Clin. Chem., 43:254-266, 1997). In general, two 

1 0 types of pharmacogenetic conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act on the body (altered 
drug action) or genetic conditions transmitted as single factors altering the 
way the body acts on drugs (altered drug metabolism). Altered drug action 
may occur in a patient having a polymorphism (e.g., an single nucleotide 

1 5 polymorphism or SNP) in promoter, intronic, of exonic sequences of gene95. 
Thus by determining the presence and prevalence of polymorphisms allow for 
prediction of a patient's response to a particular therapeutic agent. In 
particular, polymorphisms in the promoter region may be critical in 
determining the risk of HDL deficiency and CVD. In addition to mutations, 

20 polymorphisms of gene95 may likewise be relevant to the identification and 
use of agents identified using the assays disclosed herein. 

In addition, different agents may have different abilities to affect the 
genes of a signature gene set. For example, if a potential therapeutic agent, 

25 say, agent A, causes a gene, or group of genes, related to low HDL levels to 
exhibit decreased expression, such as where a lower amount of mRNA is 
expressed from said gene(s), or less protein is produced from said mRNA, but 
a second potential agent, say, agent B, while modulating the activity of the 
same or related genes causes said expression to be reduced to half, such as 

30 where only half as much mRNA is transcribed or only half as much protein is 
translated from said mRNA as for agent A, then agent B is considered to have 
twice as much therapeutic potential as agent A. 



5 
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The present invention also relates to a process that comprises a 
method for producing a product, including the generation of test data, 
comprising identifying an agent according to one of the disclosed processes 
5 for identifying such an agent (i.e., the therapeutic agents identified according 
to the assay procedures disclosed herein) wherein said product is the data 
collected with respect to said agent as a result of said identification process, 
or assay, and wherein said data is sufficient to convey the chemical character 
and/or structure and/or properties of said agent. For example, the present 

10 invention specifically contemplates a situation whereby a user of an assay of 
the invention may use the assay to screen for compounds having a desired 
polypeptide, for example, enzyme, modulating activity, or gene expression 
modulating activity, and, having identified the compound, then conveys that 
information (i.e., information as to structure, dosage, etc) to another user who 

15 then utilizes the information to reproduce the agent and administer it for 
therapeutic or research purposes according to the invention. For example, the 
user of the assay (user 1) may screen a number of test compounds without 
knowing the structure or identity of the compounds (such as where a number 
of code numbers are used the first user is simply given samples labeled with 

20 said code numbers) and, after performing the screening process, using one or 
more assay processes of the present invention, then imparts to a second user 
(user 2), verbally or in writing or some equivalent fashion, sufficient 
information to identify the compounds having a particular modulating activity 
(for example, the code number with the corresponding results). This 

25 transmission of information from user 1 to user 2 is specifically contemplated 
by the present invention. 

In accordance with the foregoing, the present invention relates to a 
method for producing test data with respect to the modulating activity of a test 
30 compound comprising: 

(a) contacting a test compound with a Gene95 polynucleotide or 
Gene95 polypeptide, or a polynucleotide construct comprising a reporter gene 
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operably li^^J to a Gene95 promoter, under itions wherein said 
polynucleotide or reporter gene is being transcribed or said polypeptide is 
active, 

(b) determining a change in the activity of said polypeptide or 
5 transcription of said polynucleotide or reporter gene as a result of said 

contacting, and 

(c) producing test data with respect to the modulating activity of said 
test compound based on a change in the transcription of the determined 
polynucleotide or reporter gene or activity of the determined polypeptide 

10 wherein said change shows modulating activity. 



Example 1 

Mutations in Gene95 cause low HDL in humans 

15 

METHODS 

Patient Ascertainment and Sample Collection 

Kindred BC11M, of Dutch origin was identified as part of a collaboration 
between the Department of Medical Genetics at the University of British 

20 Columbia and St Paul's Hospital in Vancouver, British Columbia. This kindred 
was ascertained based on a hypoalphalipoproteinemia phenotype (<5 th 
percentile) in the absence of other lipid and clinical abnormalities (e.g. obesity 
and diabetes). A history of coronary artery disease was evident in this kindred 
at the time of ascertainment. Additional Dutch kindreds NL-003, NL-120, were 

25 ascertained as part of the patient referral program to the lipid clinic at the 
Academic Medical Center in Amsterdam based on abnormal cholesterol 
levels. A control cohort of normolipidemic individuals (HDL >20 th percentile 
and <80 th percentile) was used to validate the causal nature of the mutation. 

30 Markers and Genotyping 
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Information ^Plrding genetic markers was obtain^^from the Genome 
Database and GenBank. All markers used for genome scan and fine mapping 
had been previously localized on the Marshfield map. An V was appended to 
many of the marker names for internal use. Genomic DNA was genotyped 
5 using an Applied Biosystems Prism 3100 Genetic Analyzer running 
GeneMapper software (Applied Biosystems). Mendelian inheritance of alleles 
was verified using the PedCheck program. 



Linkage Analyses 

10 Two-point linkage analysis was performed using the MUNK program v4.1p 
from the FASTLINK software package and multipoint linkage analysis was 
performed using VITESSE v. 1.0. An autosomal dominant inheritance model 
was used, assuming that 10% of all low HDL (<5 th percentile) in the population 
is due to a mutation in the gene. The disease allele frequency was assumed 

15 to be 0.25%, with 98% penetrance and a phenocopy rate of 4.5% in non- 
carriers. Marker allele frequencies were estimated from a large collection of 
American and European samples. Haplotypes were manually reconstructed 
and cross-validated using SIMWALKv2.8. 



20 RESULTS 

Linkage analysis of family BC11M: 

Twenty-four individuals from family BC11M were included in a 400 marker 
genome scan of 12 multiplex pedigrees segregating isolated 
hypoalphalipoproteinemia in a dominant fashion. Power analysis suggested 

25 that the BC1 1 M kindred was suitable to identify an HDL locus using a linkage 
based approach under an autosomal dominant model (MaxE (Zmax 5cM) = 
4.762). Two point linkage analysis of genome scan data gave a suggestive 
LOD score on chromosome 9 (D9S1677 LOD =2.285). This LOD score was 
followed up by multipoint linkage analysis, saturation genotyping and 

30 haplotyping with additional markers on an expanded pedigree. The region on 



53 



WO 2004/003159 PCT/US2003/020410 



to -a 

1 « 



I 

CIS 

c 

3 



T3 

« 

CD 

i I 

to x: 

i 1 

sj= to 

CD CO 

§ & 

-o 8> 

<D tO 

8 « 



CO 



CO 

CD 



<= a> 
o 

co 2 

o -o 

rt v CO 

g O 

CD O 

co a> 

"O CD 

1 1 

5 I 

csi S 

■S o 

l — ca 



in 
o 




u 



o 



o 



CM 
CO 



CO 

s 

T— I 

CO 
o 



14.039 1 


14.441 


14.742 


16.265 


16.465 


CN 




CO 






O 


o 


CM 


CN 






CO 


in 


O 


o 


o 


o 








D9S167 


D9S152X 


D9S1877X 


D9S1790X 


D9S1120X 



54 



WO 2004/003159 PCT/US2003/020410 







Interval 1 


s 
o 
m 

CO 

CM 
CM 














































| Interval 3 | 


u 
oo 














| Interval 2 | 


|4 .29 cM | 



































































































































































o 







































17.67 | 


cn 
o 

cn 

rH 


crv 
o 

rH 


t-l 

o 

CM 


r- 
o 
cn 

o 

CM 


cn 
o 

CO 

rH 

CM 


CO 
LT> 
rH 

in 

CM 


CO 

in 

00 

in 

CM 


CO 

m 
in 

CM 


m 
m 

CM 


00 

in 

CO 

VO 
CM 


00 

in 

VO 
CM 


oo 
m 
in 

vo 

CM 


r- 

VD 

r- 

CM 


VD 

o 

CO 


VD 

vo 
o 

CO 




in 

CM 

o 

VD 
CO 


CM 

vo 
cn 

r- 
co 


m 

VD 

oo 

CO 


CO 

r- 

CM 


rH 
rH 
O 

OO 


1.42 


o 


rH 
O 

r-l 


O 
00 


CM 

o 
o 


"3* 

oo 

CO 


CM 

o 


rH 

o 


rH 

o 

CO 

o 


o 
vo 

o 


rH 
O 


rH 
O 


CM 

rH 
rH 

rH 


VO 
CM 


CM 

o 


cn 
r- 
co 

m 




r- 
co 
cn 

rH 


CO 

o 
m 

o 


00 

r* 

CM 


GO 
VD 
CM 

in 


o 


D9S181Z 


m 
m 

CM 
CO 

cn 
Q 


CO 

CM 
CO 
CXi 
Q 


■5 

co 
cn 
Q 


co 

CD 
CM 
CO 

cn 
o 


o 

CM 
00 

T-l 

CO 

cn 
Q 


D9S1803X 


r— 
cn 

rH 
CO 

cn 
a 


X 
vo 
cn 

rH 
CO 

cn 

Q 


D9S1689X 


o 
i 

CM 
CM 

cr 
cn 


VO 

m 
a 
I 

CM 
CM 

tr 

cn 


rH 
CM 
nJ 
O 
1 

X 

CM 
CM 

cr 

cn 


r— 
oo 

CM 
CO 

cn 
Q 


VD 

rH 

Cti 
O 

CM 
CM 

cr 

cn 


rH 

o 
1 

X 
CM 
CM 

cr 
1 °^ 


o 
cn 

VD 
rH 
CO 

cn 
a 


rH 

r- 

CM 
CO 

cn 
O 


X 
VD 
O 
CO 
CO 

on 
a 


D9S1784X 


r* 
r- 
vo 

tH 
CO 

cn 
Q 


cn 

CO 
CM 
CO 

cn 
Q 



55 



WO 2004/003159 PO7US2003/020410 

• * 

chromosome^r was supported by multipoint WnkagF analysis, and the 
multipoint LOD rose to 8.0, defining an affecteds only interval of 20.91 cM 
(Interval 1; D9S1780 to 9q22xca17) (Table 2). This interval could be further 
into sub-regions based on chromosomal breakpoints in unaffected individuals 
5 (Interval 2; 9q22xca7 to 9q22x-ca17) (Interval 3; D9S1812 to D9S1803x) 
(Table 2). Interval 2 was the focus for the most comprehensive mutation 
detection effort. 



Example 2 

0 Tissue distribution of Gene 95 using an Origene cDNA panel. 

Twenty-four serially diluted (4-log range to ensure linear amplification) 
cDNA clones arrayed into a multi-well PCR plate were obtained from Origene 
(Rockville, MD). PCR was performed using gene-specific primers for Gene 
5 95 (sometimes "G95" herein). Reactions were in 10 pi and contained 1.0 pi 
10x buffer (Roche Molecular Biochemicals, Indianapolis, IN, USA), 0.25 mM 
of each dNTP (Boehringer Mannheim), 5 pmole of forward primer and reverse 
primer, and 0.25 unit of Taq DNA polymerase (Roche Molecular 
Biochemicals, Indianapolis, IN, USA). 

20 

Forward primer CTGTGTCAGCACCTGCACCTC SEQ ID NO. 10 
Reverse primer ATCCCCATGGGCACGATTTC SEQ ID NO. 11 

The amplified fragment lies between exons 4 and 6, and has a product size of 
25 230 bp. 

DNA Engine Tetrad cyclers (MJ Research, Inc., Waltham, MA, USA) 
were used for PCR amplification. The conditions for PCR were a 3-min 
denaturation step at 94°C, followed by 35 cycles at 94°C for 30 sec, 50°C for 
30 30 sec, and 72°C for 1 min. A final extension step of 7 min at 72°C was 
performed and samples were maintained at 10°C. PCR products were 
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analyzed by elecffophoresis on 2% agarose gels in 1x THE. A 100 bp ladder 
(Gibco-BRL, Rockville, MD, USA) was used as size standard. 



Table 3 



Human Tissue (Origene) 


G95 Tissue Expression Levels 


Tissue 


by PCR 


Muscle 


H ! 


Heart 


H 


Placenta 


H 


Plasma Blood Leucocytes 


H 


Brain 


M 


Lung 


M 


Kidney 


L 


Liver 


L 


Spleen 


L 


Small Intestine 


L 


Adrenal Gland 


H 


Fetal Brain 


H 


Fetal Liver 


H 


Ovary 


H 


Prostate 


H 


Stomach 


H 


Testis 


H 


Thyroid 


H 


Pancreas 


M 


Salivary 


M 


Uterus 


M 


Bone Marrow 


L 


Skin 


L 


Colon 


no expression 



5 



Relative levels of G95 cDNAs are set forth in Table XX. H=high, 
M= medium, L=low. 



10 
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Example 3 
Tissue Expression by Northern Blot 



Quantitation and sizing of the Gene 95 transcript were analyzed by use 
5 of a human 12-Lane multiple tissue Northern blot (BD Biosciences Clontech, 
Palo Alto, CA, USA). The human blot was hybridized with a 401 bp probe for 
Gene 95 from exons 6 to 9. A 4.4 kb band was identified, as expected. Actin 
was used as the control. 

Forward primer 5' TTTCCAGAATTGCGTGTTGGTG 3* SEQ ID NO. 12 
1 0 Reverse primer 5' TGGATGCCACCCTTGAGCTG 3' SEQ ID NO. 1 3 



Results (Figure 9) show that Gene 95 is ubiquitously expressed with 
highest levels in muscle and heart. These results were confirmed by Northern 
blot analysis of mouse tissues (data not shown). In one such run, using 
15 fibroblasts, approximately 24 hours prior to cholesterol treatment, 2 x 10 5 cells 
were plated into each well in a 6-well format so as to obtain 50-70% 
confluency the following day. Cells were treated with a final concentration of 
30 jig/ml cholesterol for 24 h followed by total RNA preparation. The control 
was Gene 95 expression from unaffected cell lines. 

20 

Example 4 
Gene 95 expression in patient fibroblasts 

25 

The Ser to Ala mutation within exon 10 of Gene 95 is present in four 
patient fibroblasts (400, 558, 549, and 575). We decided to compare the 
expression of Gene 95 in these fibroblasts to normal controls (226, 475, and 
485). This analysis was performed in the absence and presence of 
30 exogenous cholesterol, in order to explore a connection between lipid 
metabolism and Gene 95. Quantitative Real Time PGR (or TaqMan n ) assays 
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were performecras follows. Human Gene 95 primers and their probes were 
designed using Primer Express software (Applied Biosystems, Foster City, 
CA). 

5 Forward primer: 5' CCAAGGGAGTGTGCAAGG -3' SEQ ID NO. 31 

Reverse primer: 5' CCTTrGTAAAAGCCATCAGGAAACT-S' SEQ ID NO. 32 



RT-PCR was carried out in an ABI 7900 sequence detection system 
0 in a final volume of 25 ul, containing 120 ng of total RNA for human Gene 95 
or 50 ng for human apoA1, 200|aM primers, and 600 jaM probe in 1x Taqman 
one-step RT-PCR Master mix (Applied Biosystem, CA). GAPDH or 18s 
(human) (Applied Biosystems) was used as the internal control. Data 
quantification and analysis was performed according to manufacturer's 
5 protocol. Briefly, AC T for the calibrator and samples was obtained by 
subtracting the VIC C T value of 18s RNA or GAPDH from the FAM C T value of 
the gene of interest. AAC T was calculated by subtracting the average AC T 
(calibrator) values from AC T (sample). The mRNA quantity for the calibrator is 
expressed as 1 x sample and all other quantities are expressed as a number 
20 of percentage changes relative to the calibrator. Each sample was assayed in 
triplicate and standard error (SE) was calculated. RT-PCR reactions without 
RNA, and reactions with RNA but without reverse transciptase were used as 
negative controls. The size of the PCR product was confirmed by agarose gel. 

25 Overall, extremely low levels of expression of Gene 95 were found in 

all fibroblasts, including controls. TaqMan results indicated that Gene 95 
expression in two patient cell lines (400 and 575) was down regulated by 
approximately 50% compared to the controls either with or without cholesterol 
treatment. Expression of Gene 95 in patient 558 did not vary significantly from 

30 the controls. However, the expression of Gene 95 in 549 was elevated above 
controls (data not shown). 
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A trendxowards reduced Gene 95 expression UPPnutation carriers was 
noted. Messenger RNA expression levels may be sensitive to cell culture 
conditions, cell density and passage number. Without desiring to be bound to 
any theory of mechanism of action, it is suggested that this trend towards 
5 decreased levels of Gene 95 mRNA in mutation carriers may be the 
underlying mechanism of the low HDL condition in these patients. 
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1. A method for identifying an agent that modulates Gene95 activity, 
comprising: 

5 a) contacting a test compound with a genetic construct comprising a 

reporter gene operably linked to a Gehe95 promoter under conditions 
supporting transcription of said reporter gene; 

b) determining a change in transcription of the reporter gene as a result 
of said contacting 

10 wherein a change in said transcription indicates that the test compound 

is an agent that modulates Gene95 activity. 

2. The method of claim 1 wherein the determined change in 
transcription of step (b) is a decrease in transcription. 

15 

3. The method of claim 1 wherein the determined change in 
transcription of step (b) is an increase in transcription. 

4. The method of claim 1 wherein transcription is determined by 
20 measuring the amount of an expression product encoded by said reporter 

gene. 

5. The method of claim 4 wherein the expression product is an RNA. 

25 6. The method of claim 4 wherein the expression product is a 

polypeptide. 

7. The method of claim 1 wherein the reporter gene is in a liposome. 
30 8. The method of claim 1 wherein the reporter gene is in an intact cell. 

9. The method of claim 8 wherein the intact cell is a mammalian cell. 
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10. method of claim 1 wherein the pr^fcter is a mammalian 
Gene95 promoter. 

11. The method of claim 10 wherein the Gene95 promoter is a human 
5 promoter. 

12. The method of claim 11 wherein the human promoter comprises 
the promoter sequence in SEQ ID NO: 15. 

10 13. The method of claim 10 wherein the Gene95 promoter is a mouse 

promoter. 



14. The method of claim 13 wherein mouse promoter comprises the 
promoter sequence in SEQ ID NO: 14. 

15 

15. The method of claim 1 wherein the reporter gene is not Gene95. 



16. A method for identifying an agent that modulates a Gene95 activity, 
comprising: 

20 a) contacting a test compound with a polypeptide encoded by a 

polynucleotide corresponding to gene95 under conditions supporting an 
activity of said polypeptide; and 

b) determining a change in the activity of the polypeptide as a result of 
said contacting; 

25 wherein said change in activity identifies the test compound as an 

agent that modulates a Gene95 activity. 



17. The method of claim 16 wherein the determined change in activity 
in step (b) is a decrease in activity. 

30 

18. The method of claim 16 wherein the determined change in activity 
in step (b) is an increase in activity. 
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19. The^fethod of claim 16 wherein the actr^^ is measured by 
measuring the.activity of an enzyme. 



20. The method of claim 16 wherein the polypeptide is present in a lipid 
5 bilayer. 

21. The method of claim 20 wherein the lipid bilayer is part of a 
liposome. 

10 22. The method of claim 16 wherein the polypeptide is part of an intact 

cell. 

23. The method of claim 22 wherein the intact cell is a cell that has 
been engineered to comprise said polypeptide. 

15 

24. The method of claim 22 wherein the intact cell is a recombinant cell 
that has been genetically engineered to express said polypeptide. 

25. The process of claim 24 wherein the cell does not express said 
20 polypeptide absent said engineering. 

26. The process of claim 22 wherein said cell is a mammalian cell. 

27. A method for identifying an HDL-enhancing agent, comprising 
25 administering to an animal an effective amount of an agent found to have 

modulating activity using an assay of claim 1, 13, 24 or 49 and detecting an 
increase in plasma HDL activity in said animal due to said administering 
thereby identifying an agent useful in enhancing HDL activity. 

30 28. The method of claim 27 wherein the animal exhibits low HDL 

activity prior to administering said agent. 
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29. The method of claim 27 wherein the HDL-e«l5ncing activity is an 
increase in HDL level in said animal. 

30. The method of claim 29 wherein the increase in HDL level is an 
5 increase in plasma HDL level. 

31 . The method of claim 29 wherein the animal is a human patient. 

32. A method for treating a low-HDL related disorder in an animal 
10 afflicted with said disorder comprising administering to said animal an 

effective amount of an agent found to have HDL-enhancing activity using the 
assay of claim 27. 



15 



33. The method of claim 32 wherein the animal is a human patient. 

34. The method of claim 32 wherein the disorder is selected from the 
group consisting of low HDL diseases, vascular diseases and dyslipidemias. 

35. A method for producing test data with respect to the modulating 
20 activity of a test compound comprising: 

(a) contacting a test compound with a Gene95 polynucleotide or 
Gene95 polypeptide, or a polynucleotide construct comprising a reporter gene 
operably linked to a Gene95 promoter, under conditions wherein said 
polynucleotide or reporter gene is being transcribed or said polypeptide is 

25 active, 

(b) determining a change in the activity of said polypeptide or 
transcription of said polynucleotide or reporter gene as a result of said 
contacting, and 

(c) producing test data with respect to the modulating activity of said 
30 test compound based on a change in the transcription of the determined 

polynucleotide or reporter gene or activity of the determined polypeptide 
wherein said change shows modulating activity. 
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36. A method for treatment or prophylaxis of cardiovascular disease 
comprising administering to a patient in need thereof, a therapeutic or 
prophylactic dose of a compound which modulates gene95 protein biological 
activity or expression. 



37. A method for identifying a therapeutic agent, comprising: 

a) contacting a chemical agent with a polynucleotide corresponding to 
gene95 under conditions supporting expression of said polynucleotide; 

b) determining a change in the expression of said polynucleotide as a 
1 0 result of said contacting; 

wherein a change in said expression identifies a therapeutic agent. 

38. A method for identifying a therapeutic agent that modulates the 
activity of a polypeptide that affects high density lipoprotein (HDL)-activity In 

15 vivo, comprising: 

a) contacting a test compound with a polypeptide encoded by a 
polynucleotide corresponding to gene95 under conditions supporting an 
activity of said polypeptide; and 



20 of said contacting; 

wherein a change in said activity identifies the test compound as a 
therapeutic agent that modulates the activity of a polypeptide that affects HDL 
activity. 



5 



b) determining a change in the activity of said polypeptide as a result 



25 



39. The method of claim 38 wherein said polypeptide corresponds to 



SEQ ID NO: 4. 



40. The method of claim 38 wherein said polynucleotide corresponds to 



SEQ ID NO: 3 



30 
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41. Ai^^^lated polynucleotide comprising a p^f^ucleotide sequence 
or the full complement of the polynucleotide sequence, wherein the 
polynucleotide sequence is at least 95% identical to SEQ ID NO: 3. 

5 42. An isolated polynucleotide comprising a polynucleotide sequence 

that encodes a polypeptide having the amino acid sequence set forth in SEQ 
ID NO: 4. 

43. An isolated polynucleotide comprising a polynucleotide that has 
0 the sequence set forth in SEQ ID NO: 3. 

44. An isolated polypeptide comprising an amino acid sequence having 
at least 95% identity with the amino acid sequence set forth in SEQ ID NO: 4. 

5 45. The isolated polypeptide of claim 44, wherein the isolated 

polypeptide comprises an the amino acid sequence having at least 95% 
identity with the amino acid sequence set forth in SEQ ID NO: 4. 

46. An isolated polypeptide comprising the amino acid sequence set 
20 forth in SEQ ID NO: 4. 

47. An isolated polypeptide consisting of the amino acid sequence set 
forth in SEQ ID NO: 4. 

25 48. A nucleic acid vector comprising the isolated polynucleotide of 

claim 41. 

49. A recombinant host cell comprising the vector of claim 48. 

30 50. A method for producing the polypeptide of SEQ ID NO: 4 

comprising culturing the host cell of claim 49 under conditions supporting 
production of the polypeptide. 
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Figure 5 A 



aaggcgcgcggatgggccggcacgcaggcgcactagctcgccacggccccggaagcgcag 
gagaggccgccgggtggggctaggcgctgcgacagccggcgtgaggaagctcagtgggct 
acgaacgtctggcacacatgcaaccgccccctcgggctgcctccgcctgccggctacttc 
tttctcccgccttccgctc 

El : ATGTCCTTCCGGGGCCACGCAGAAAGTGCCGCCGCTTTGGCCACTCAGAGC 
+2: CPSGATQKVPPLWPLRA 
El : CCCCGGGCCGCGGTCGTCGTACGCCTGAAGGCGGGTCGTGCCGGCGGCCGC 
+2: PGPRSSYA*RRVVPAAA 
El : TCTAGTCTCCGCCTCCGCTCAGGCCGGTCCTCCGGGGCTTCTCAATGGTTT 
+2: L V S A SAQAGPPGLLNGF 
El : CCCGGTGGCCTCTCAATGGTTTTCCCGGCGGCCCTTGCGCCGACGCCAGGA 
+2: PVAS QWF SRRPLR RRQE 
El : GACTTCCGGAGCTTGGTGACGTCACGAGCGAGCTTTTCTACCCAAATACGC 
+2: T SGAW*RHERAFLPKYA 
El : GGCGGGGGAATAGGCTCGAGGGCGGTGAGCAGTGACAATTGCTAGGCGGAG 
+2: AGE * ARGR*AVT IARRR 
El : ACAGT GC AGGGAAGAGAGACCT T AGAAAGGAT CAGG ACTGGCGG 
+2:QCREERP*KGSGLA G 



gtatgt. . . cttatttatattctag 

E2 : GAGGTATTTAACTGAAAGGAATATCTGCTTCACTGTTGCAACCAAACCAGA 
+3: GI*LKGISASLLQPNQM 
E2 : TGCCTTCTTCCACTTCACCAGACCAAGGAGATGACCTGGAGAACTGCATTT 
+3: PSSTSPDQGDDLENCIL 
E2 : TAAGATTTTCTGACCTGGATTTAAAAGATATGAGTCTTATTAATCCCAGCA 
+3: RFSDLDLKDMSLINPSS 
E2 : GCAGTCTTAAAGCAGAATTAGATGGCAGTACAAAAAAGAAATACTCGTTTGO 
+3: SLKAELDGSTKKKYSFA 
E2: CAAAGAAAAAG 
+3: K K K 

gtagaa. . . 
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Figure 5B 



10/518868 



. . . tgaaatctaattgcag 



E3 : GCCTTTGCCCTTTTTGTCAAAACCAAAGAAGTTCCAACAAAAAGGAGTTTT 
+ 1: A F A L FVKT KEV PT KRS F 
E3 : GAATGTAAAGAAAAATTGTGGAAATGCTGTCGGCAGCTATTCACAGACCAA 
+1: ECKEKLWKCCRQLFTDQ 
E3 : ACCAGCATCCATAGACATGTGGCAACACAACATGCTGATGAAATTTATCAC 
+1: T S I HRHVATQHADE IYH 
E3 : CAGACAGCTTCTATTTTAAAGCAACTGGCTGTGACATTGAGCACCTCAAAG 
+ 1: QTASILKQLAVTLSTSK 
E3 : AGTCTTTCGTCTGCAGATGAAAAGAACCCTTTAAAAGAGTGCCTTCCACAT 
+1: S L S S A D E K N P L K E C L P H 
E3 : AGCCATGACGTGTCTGCTTGGCTCCCTGATATAAGCTGCTTTAACCCTGAT 
+1:SHDVSAWLPDI SCFNPD 
E3: GAGCTGATAAG 
+1: E L I S 



E4 : TGGCCAGGGCAGTGAAGAAGGGGAGGTGCTCCTTTATTACTGCTACCATGA 
+2: GQGSEEGEVLLYYCYHD 

E4 : CCTGGAGGATCCCCAATGGATCTGTGCCTGGCAGACAGCTCTGTGTCAGCA 
+2: LEDPQWICAWQTALCQH 

E4: CCTGCACCTCACAGGCAAG 
+2: L H L T G K 



E5 : ATTCGAATTGCTGCAGAAGGAATCAATGGGACAGTTGGTGGAAGCAAATTG 
+1:1 R I A A E G I N G T V G G.S K L 
E5: GCTACCAGACTTTATGTGGAAGTCATGCTTTCCTTCCCATTGTTTAAGGAT 
+1: ATRLYVEVMLS FPL FKD 
E5 : GACCTGTGTAAAGATGATTTTAAG 
+1: DLCKDDFK 



gtaaga. . . gattttcattttatag 



gtaaca. . . ccgtcttgtgtctcag 



gtaaga . 
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Figure 5C 



10/51886& 



. . . gtttctcattggctag 



E 6 : ACCAGCAAAGGAGGAGCTCACTGTTTTCCAGAATTGCGTGTTGGTGTATTT 
+ 1:T SKGGAHCFPELRVGVF 
E 6 : GAAGAAATCGTGCCCATGGGGATCAGCCCCAAAAAGATCTCCTACAAGAAG 
+1: EEIVPMGISPKKISYKK 
E6: CCTG 
+ 1: P G 



E7 : GAATCCATTTATCCCCAGGTGAATTTCATAAAGAAGTAGAAAAGTTTTTAT 

+3: IHLSPGEFHKEVEKFLS 

E7 : CTCAGGCAAATCAAGAACAAAGTGATACTATCCTTCTTGATTGCAGAAACT 

+3: QANQEQSDT ILLDCRNF 

E7 : TCTATGAAAGCAAAATA 

+3: Y E S K I 



E8 : GGACGATTCCAAGGCTGCTTAGCCCCAGACATCAGGAAATTCAGTTACTTC 
+1: GRFQGCLAPDI RKFSYF 
E8 : CCTAGCTACGTTGACAAAAATCTAGAACTTTTCAGAGAGAAGAGAGTGCTG 
+1:P SYVDKNLELFREKRVL 
E8 : ATGTACTGTACCGGGGGCATCCGCTGTGAGCGGGGTTCAGCCTACCTCAAA 
+1: MYCTGG-IRCERG SAYLK 
E8: GCCAAG 
+1: A K 



E9 : GGAGTGTGCAAGGAGGTGTTCCAGCTCAAGGGTGGCATCCACAAGTACCTG 
+1: GVCKEVFQL. KGGIHKYL 
E9 : GAAGAGTTTCCTGATGGCTTTTACAAAGGGAAGTTGTTTGTTTTTGATGAA 
+1:E EF P DG FY KGKL FV FDE 
E9 : CGCTATGCTCTGTCCTACAACAGTGATGTGGTGTCAG 
+1:RY ALSYNSDVVS E 



gtatgc. . . tttggtttggttttag 



gtaagt. . . tgctcctatgttacag 



gtgagc. . . gtttttccacacctag 



gtaggt . 
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E10 
+3 
E10 
+3 
E10 
+3 
E10 
+3 
E10 
+ 3 
E10 
+3 
E10 
+ 3 
E10 
+ 3 
E10 
+3 
E10 
+ 3 
E10 
+3 
E10 
+3 
E10 
+ 3 
E10 
+3 
E10 
+ 3 
E10 
+3 
E10 
+3 
E10 
+3 
E10 
+3 
E10 
+3 
E10 
+3 
E10 
+3 



. . . tttccttcctccccag 

AGTGTTCATACTGTGGAGCCCGCTGGGACCAGTATAAACTCTGCTCTACTC 
CSYCGARWDQYKLCSTP 
CCCAGTGCCGCCAGCTCGTTTTGACCTGCCCTGCCTGTCAAGGACAAGGAT 
QCRQLVLTCPACQGQGF 
TCACAGCCTGTTGTGTCACATGTCAAGACAAGGGGAGCAGGAAAGTTTCAG 
TACCVTCQDKGSRKVSG 
GCCCTATGCAAGACAGCTTTAAAGAGGAATGCGAGTGCACAGCCCGACGGC 
PMQDSFKEECECTARRP 
CACGCATACCTAGGGAACTCTTGCAGCATGTGCGACAGCCTGTGAGCCCAG 
RI PRELLQHVRQPVS PE 
AGCCAGGGCCTGATGCTGATGAGGATGGGCCAGTGCTTATGTGAGCAGCAC 
PG P DA DE DGP-VLM *AAP 
CTTTGGCATTTTCCCAGGCCCTCGGTAAAAG i TAGGTTTGGGGTGACTATAC 
LA F S Q A L GK S R FG V T I Q 
AGAGAAAGCATGGCAAGACTGCAGAAACAGAGAAATCGGGAACTTCAGTTC 
RKHGKTAETEKSGTSVL 
TGGCCGCTGCCACCGTGGCAGCCGTCTACACTTCACAGCGGGAGGGGAGGA 
AAATVAAVYT SQREGRS 
GTCACGTTGTCTACCACTTACCTGAGACATTCTGATTTGGATGATGCTAGA 
HVVYHLPETF*FG*C*S 
GCACAGAAAATAGGTGAGCTGCATGGGATCCCAAAGCTGCTGAGGGATAGA 
TENR*AAWDPKAAEG*S 
GCCTGAGCCTGGTGGCCACAGCATATGCCCTTTCTGTTCCATGCAGCTGGG 
LSLVATAYA LSVPCSWG 
GCTGTTAGTAGTCATTGCCCTTGTCAGCAGACCTTCTACCCTGGTGGCAAA 
C * * SLPL SADLLPWWQT 
CACATGAAAGCTGTGGCCCTGGGAGTGGCCTCCTAAAACAAGCCACTTAGG 
HESCGPGSGLLKQAT*V 
TCATCTGCCATCTACCCTTAACCTCTGTCTCTCGCCTGAGGGGAATCTGCA 
ICHLPLTSVSRLRGICK 
AGCTGTGCATTGGGCTTACCTCCTGCTTTTGTAGAAATAACCATCCTTTGG 
LCIGLTSCFCRNNHPLV 
TATACATGGAGGATAGTTCCAGAACGCCTGAGTATACAAAAACCCAATGCA 
YMEDSSRTPEYTKTQCI 
TACTCAAGTCCCACAGTGGGCCCTACAGAACCCACGTATGTGATAAATCAG 
LKSH SGPYRTHVCDKSA 
CCCTCCATGTACGCAGGTTTCGCCCCCTGCCAATACTGTATTTTCAACCTG 
LHVRRFRPLPILYFQPV 
TATGGTTGAAAAAAATCCATATATAAGTGCAGCCATGCAGTTCAAACCCAT 
WLKKIHI *VQPCSSNPY 
ATTGTTCAAGGGTCAACTGTATAGTTTATTGAACAGCCACACCCATTCCTT 
CSRVNCIVY*TATPIPL 
TACACATGATCTATGGCAGAGTTGAATAGTTGCAACAGACACTATGTGGCC 
HMIYGRVE* LQQTLCGL 
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Figure 5E 



* (C) 



86 



E10 : TGCAAAATCGGAAATTTTTACTGTCTGGCCTTTTACAGAAAAGTTTGCCAG 
+ 3: QNRKFLLSGLLQKSLPA 
E10 : CCCCTGATCTAGACCAGCAGCTCATCTGATAGAGGCAGAGGTGGCCTTAAA 
+ 3: PDLDQQLI**RQRWP*R 
E10 : GATGTGGCCTTCTTCATTTTCTGTTGGTTTGGTTTCGTTTCTATGAGAGAT 
+ 3: CGLLHFLLVWFRFYERF 
E10 : TTCCTCTGATAGCTCTGCTTTCCCCAGCACTTACTCTCTGAGCTTTTAAAT 
+3: PLIALLSPALTL*AFKC 
E10 : GTTCTCTCTGGGAGCTTCATATAAGCTCGGTGACATTTGAGCCACAGTTTT 
+3: SLWELHISSVTFEPQFL 
E10 : TAGATCAGCACCTGGAATACATGACACATTCTTACTGAGGTCATCCAGCAC 
+3: DQHLEYMTHSY*GHPAL 
E10 : TGCCATGGTGGCTGCCCAGTCTTCTGGCCAGTGTGCCAGGCACATGTCCCT 
+3: PWWLPSLLASVPGTCPC 
E10 : GTCACACAGGTTCCAAGAAACACATACGCAGCCATGCATAGACCAACAGAT 
+3: HTGSKKHIRSHA*TNRF 
E10 : TTAATATTATATTGCAGTTTTCAGCGATGCAGAATGCAGCTGCAATTGTGT 
+3: NIILQFSAMQNAAAIVF 
E10 : TTTAAGGAGAAGCCAAATGGGGATGGTTGTCCCTGCAACATGGTGCCACTC 
+3: *GEAKWGWLSLQHGATP 
E10 : CTGGGCCATGTGCAGCCTCAGTGGACACTCTTCCATAGCGCTGAGGCCCTG 
+ 3: GPCAASVDTLP*R*GPG 
E10 : GCCCCGCCTCCAGTTACCCTGTACTGCCCACTGCCTTACAGTTCAGTGCGC 
+3: PASSYPVLPTALQFSAQ 
E10 : AGGCCTTCACCTTTTCATCACCAGCCTCTCTGCTCAGTGCTCTGGAGTTCT 
+3: A F T F S S V P — A SLLSALE FL 
E10 : TGACCTTGTCCTTTATCATGAGATTTGCTGAAATCACTAATGAAAATAACT 
+3: TLSFIMRFAEITNENNS 
E10 : CCCAAAAGCAACAAACAAAAATATTAGTTTAACTGGCACTGTGGTATATTA 
+3: QKQQTKILV*LALWYIK 
E10 : AAAGGCACAAGGGCATTGTGGCTTAACACTTTTGCTGGATCCCAAGAGACG 
+3: RHKGIVA*HFCWI PRDA 
E10 • CACATGATGTTAAAAAGAGATCTGGCAGCAGTACTAATACTACATTTCAGT 
+3: HDVKKRSGSSTNTTF QC 
E10 : GTAATCATCTTGGGGTGGTTTGGCCAGGATTTCCCAATTCCTTGATATCTG 
+ 3: NHLGVVWPGFPNSLISG 
E10 : GAGTTTCTTCACCATTGTCCGGCATCCTGCGGAGGCTTAATATACAGGCGT 
+ 3: VSSPLSGILRRLNIQA* 
E10 : AAGGTCAGCAGCAATTTGTCTAATAAGTGATGAGATCAGTAGCTGAAGTCT 
+3: GQQQFV* *VMRSVAEVS 
E10 : CTAAGCTGGGCCATTACTAAATACCATAGCCATGTTGATCTGGAAATTTAT 
+3: K LGHY*IP*PC*SGNLS 
E10 • CCCTCTAGTGTCTTACCTCACATAAGCCATTTGCCCACTGTGCAATATAGA 
+3 " : L*CLTSHKPFAHCAI*K 
E10 : AAGGTGTTTTCAAAAGTATTTGGCCGTAGATTTTCACATCCATCATAAGGT 
+3: GVFKSIWP*IFTSIIRL 
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Figure 5F 



10/518 868 



E10 : TGGCATTCAATAAGGAAAAAGTTCTAACTCCAGTATTAAATTGTACATAAA 

+ 3: AFNKEKVLTPVLNCT*I 
E10 : TCCCAAATGTTCTTAAAGAACACTCAGGGACATGTTTGTTGCCTGGGATTG 

+ 3: PNVLKEHSGTCLLPGIG 
E10 : GTAATGAAAGGTTGGTTTTTGAAACTTGAAATTTCACCATTGGTTTTTTTC 

+ 3: NERLVFET*NFT IGFFP 
E10 : CTATCATTTCTGCATATCCAGCAAAAGGAATCTCATGTTGACTCCTGGCAG 

+3: IISAYPAKGISC*LLAE 
E10 : AGTTCAGTGGCTTCAGTCTGTCTATCTGTTCTGAGGGGAAAATTGTGTTCT 

+3: FSGFSLSICSEGKIVFW 
El 0 : GGATCCAGTAATCAATTTGGCAACTTTAATCGAGGTTTTCAAAATTCCAAG 

+3: I Q * S IWQL*SRFSKFQG 
E10 : GAGGGTT AAT AAAGAAT GAT AAT CAGTTT TATTT GCT AAT AGCT AAG AC AA 

+3: GLIKNDNQFYLL IAKTN 
E10: ATTTGTAATAAAGTGTTTTATAATACTTC 

+ 3: L * * • S ' V L, * Y F 



E6a : GTTACGAGACAGTACAATAGAAGGAGTATGCTCGTCCCCATTCTTTCACTG 

+1: VTRQYNRRSMLVPILSL 
E6a : AGTCACCATATGATTTTGGACCAGCTAGTGCTCTAGACCTCAGTATCCCTT 

+ 1: S H H M I L D Q L V L * T S V S L 
E6a : CTTATAAAATAAGAATGTTACAGCTCATGCAATCTGGGACTCCAAATCTTG 

+1: LIK* ECYSSCNLGLQIL 
E6a : GACATATTAGCTCACTTGAGAGACCACCAGCCTGGTCAGCAGATCACTGTG 

+1: D I LAHL RDHQP GQQ I TV 
E6a : TTTTTAGTAAATCTGGAATTGTAAGATTAACACTTCATACCACATGGGGGA 

+1: FLVNLEL*D*HFIPHGG 
E6a : ATAAAGTTGTTGCTCTCACAGGT 

+1: I K L L L S Q 



. . . gtgcttttctctttag 



gggctg. 
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Figure 5G 



. . .gtttttccacacctag 

E9a : GGAGTGTGCAAGGAGGTGTTCCAGCTCAAGGGTGGCATCCACAAGTACCTG 

+1: G V C K E V FQLKGG I HKYL 
E9a : GAAGAGTTTCCTGATGGCTTTTACAAAGGGAAGTTGTTTGTTTTTGATGAA 

+ 1: EEFPDG FYKG KLFVFDE 
E9a : CGCTATGCTCTGTCCTACAACAGTGATGTGGTGTCAGGTAGGTCAGCACAG 

+1: RYALSYNSDVVSGRSAQ 
E9a : GCTCAGAGCCCAAACTGAAATGAAGCACATTGTCAGTTCACTATTCTAGAA 

+1: AQS PN * N EAHCQFT I L E - 
E9a : AAATGACACAGGGAAGACAGGCCAGTGCTCATTACTGAGCACTGAATAAGC 

+1:K*HREDRPVLITEH* IS 
E9a : AGGGAAAATAAGTACATTGTGCCACCATTTTCCCAGCTGTGGAGCTGAGAG 

+1:R E N K Y I V P P F S.Q L W S * E 
E9a : AACCCTAGCCCAGGAG.TCAGGAGGCCTGGGTTGGGATCCTGGCTTCACCAT 

+1:NPSPGVRRPGLGSWLHH 
E9a : TGCTAGCTGGACAAGCCCATTAACATGGGGATCATCTCACCTGCCCTGCCT 

+1: C * L DKP I NMG I I S PAL P 
E9a : GCCTGTCTACCTGCCAAGAGCTGTACTACTGGGCTAATTCAGGGCTCTTAA 

+1 : A C L P A K S C T T G L I Q G S * 
E9a : CCTGGAATTGGTACATAGATTTCAGGGATTCTGTGAATTTGGATGGAAAAA 

+1:PGIGT*ISGIL*IWMEK 
E9a : TAATTGTATCTTTGTTTTCAATAACACCTCACTAAAATGAAGCATTTCCTT 

+1: * LYLC FQ * HLT KMKH FL 
E9a : TAGTTATGAATGTAGGCAACAAAGTACCAGTTGTATTAATGTACCTGTGAC 

+1:*L*M*ATKYQLY*CTCD 
E 9a : TTTGTCTTCAGTAGGATTCACAATACTTTCATATCATGTTCTAGTTGCCTC 

+1:FVFSRIHNTFISCSSCL 
E9a : AGAT AT CT C AAAAT AGT AT TTATACTCAT CACTGCTTCAAAAT GAAAATAG 

+1:RYLKIVFILITASK*K* 
E9a : TTATTAGGCCCACCACTAAGAGTTGATATATAATGTGTTAATAAATGGCAC 

+1: L L G PPLRVDI * CVN KWH 
E9a : GTCTTATTATATATTACAGATTTTGAAAAAGA 
+1: VLLYITDFEK 
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Figure 6 



human 
mouse 



1 ^PSSTSPb jQC 

1 |mpsstspd| ei 



KFS jDLDLKj PMSniplPS 
^F lDLDLK|] SSNH VpPS 



slkaeldgstkkkysfakkkafalIf 



psIn!slkaeldgstkkkysfakkkafal[l 



human 
mouse 




Dq1t|S I HRHVAT QHft^ fc j^ jQTASljgKQI^ VTpSl 
D^llS! HRHVATQHA[g^^^^ 



human 121 T| 
mouse 120 



; SA@E-SNPL^^gPH§H3^^^^p 
|PT0KgS S P^@T PpQ E|VSAWLPDp 



]i |sg^ gS| 



SVLLYYCYHDLE 



LVLLYYCYCDLE 



human 181 |DI 
mouse 180 



\WQTALC1q1hIjHLTGKIRIA^ 



FPLFK 



human 241 
mouse 240 



a [hcfpelrvgvfeeivpmgispi kki |sykkpgihlspgefhke^ k|f ^sq| anq 



human 301 [^SDlTlll Jl^ 

mouse 300 ^V^ ^^^^^^^^^^^^^^^^^W & ^^^^^^ E 



human 361 jCERGSAYI 
mouse 360 |CERGSAYI 



<gvckevfqlkggihkyleefpdgfykgklfvfde: 
sgvckevfqlkggihkyleefpdgfykgklfvfde 




human 421 pTCGi 



Jr|wdqyklcs t pqcrqlvlt c 


PlACQGQGFTACCVTCQDKq. 


Jp|WDQYKLCSTPQCRQLVLTC 


SlACQGQGFTACCVTCQDKGp 



^sgp Nqps fkeecei 
^dsfkeece | 



human 481 
mouse 480 



CTARRPRI 



CTARRjHES 




Legend: 

Identical amino acids: 



Similar amino acids: jSi? 



La m 
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P1G95-1 LKGISASLLQPNQMPSSTSPDQGDDLENCILRFSDLDLKDMSLINPSSSLKAELDGSTKK 
P1G95-2 LKGISASLLQPNQMPSSTSPDQGDDLENCILRFSDLDLKDMSLINPSSSLKAELDGSTKK 
PlG95-n LKGISASLLQPNQMPSSTSPDQGDDLENCILRFSDLDLKDMSLINPSSSLKAELDGSTKK 
************************************************************ 

P1G95-1 K Y S FAKKKAFAL FVKTKE VPTKRS FECKEKLWKCCRQL FT DQT S I HRHVAT QHADE I YHQ 
PI G9 5-2 KYS FAKKKAFAL FVKTKE VPTKRS FECKEKLWKCCRQL FT DQT S I HRHVAT QHADE I YHQ 
PlG95-n KYS FAKKKAFAL FVKTKEVPTKRSFECKEKLWKCCRQLFTDQTSIHRHVATQHADEI YHQ 
************************************************************ 

P1G95-1 TASILKQLAVTLSTSKSLSSADEKNPLKECLPHSHDVSAWLPDISCFNPDELISGQGSEE 
P1G95-2 TASILKQLAVTLSTSKSLSSADEKNPLKECLPHSHDVSAWLPDISCFNPDELISGQGSEE 
PlG95-n TASILKQLAVTLSTSKSLSSADEKNPLKECLPHSHDVSAWLPDISCFNPDELISGQGSEE 
************************************************************ 

P1G95-1 GEVLLYYCYHDLEDPQWICAWQTALCQHLHLTGKIRIAAEGINGTVGGSKLATRLYVEVM 
P1G95-2 GEVLLYYCYHDLEDPQWICAWQTALCQHLHLTGKIRIAAEGINGTVGGSKLATRLYVEVM 
PlG95-n GEVLLYYCYHDLEDPQWICAWQTALCQHLHLTGKIRIAAEGINGTVGGSKLATRLYVEVM 
************************************************************ 

P1G95-1 LSFPLFKDDLCKDDFKTSKGGAHCFPELRVGVFEEIVPMGISPKKISYKKPGIHLSPGEF 
P1G95-2 LSFPLFKDDLCKDDFKTSKGGAHCFPELRVGVFEEIVPMGISPKKISYKKPGIHLSPGEF 
PlG95-n LSFPLFKDDLCKDDFKTSKGGAHCFPELRVGVFEEIVPMGISPKKISYKKPGIHLSPGEF 
************************************************************ 

P1G95-1 HKEVEKFLSQANQEQSDTILLDCRNFYESKIGRFQGCLAPDIRKFSYFPSYVDKNLELFR 
P1G95-2 HKEVEKFLSQANQEQSDTILLDCRNFYESKIGRFQGCLAPDIRKFSYFPSYVDKNLELFR 
PlG95-n HKEVEKFLSQANQEQSDTILLDCRNFYESKIGRFQGCLAPDIRKFSYFPSYVDKNLELFR 
************************************************************ 

P1G95-1 EKRVLMYCTGGIRCERGSAYLKAKGVCKEVFQLKGGIHKYLEEFPDGFYKGKLFVFDERY 

P1G95-2 EKRVLMYCTGGIRCERGSAYLKAKGVCKEVFQLKGGIHKYLEEFPDGFYKGKLFVFDERY 

PlG95-n EKRVLMYCTGGIRCERGSAYLKAKGVCKEVFQLKGGIHKYLEEFPDGFYKGKLFVFDERY 

• ************************ ************************************ 

P1G95-1 ALS YNSDWSECS YCGARWDQYKLCSTPQCRQLVLTCPACQGQGFTACCVTCQDKGSRKV 
P1G95-2 ALSYNSDWSECSYCGARWDQYKLCSTPPVPPARFDLPCLSRTRIHSLLCHMSRQGEQES 
PlG95-n ALS YNSDWSECS YCGARWDQYKLCSTPQCRQLVLTCPACQGQGFTACCVTCQDKGSRKV 
**************************** : *. . : : - :*.:: 

P1G95-1 AGPMQDSFKEECECTARRPRIPRELLQHVRQPVSPEPGPDADEDGPVLM 

P1G95-2 FRPYARQL 

P1G95-H SGPMQDS FKEECECT ARRPRI PRELLQHVRQPVS PE PGPDADE DGPVLM 

.1. 
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<120> Novel Therapeutic Target for Treating Vascular Diseases, 
Dyslipidemias and Related Disorders 

<130> 760050-100 



<140> 
<141> 

<150> 
<151> 



US/60/391,878 
2002-06-27 



<160> 32 

<170> Patentln version 3.0 



<210> 1 

<211> 501 

<212> DNA 

<213> Homo sapiens 



<400> 1 

agtagctctc 

cacacactgt 

tacattccac 

caggaccatc 

tgcgtgtgtg 

gctctgttgc 

caggttcaag 

ctcttggcta 

gagtgcagtg 



ctgataaaag 
gtgcctttat 
cccaaaataa 
ttcataaata 
tgtgtgtgtg 
ccaggctgga 
caattctcat 
attttttttt 
gcgcgatctt 



accaccaacc 
gtcctagaaa 
acatgggtca 
ttcatagctc 
tgtgtgtgtg 
gtgcagcagt 
gtctcagccg 
tttttttttg 

g 



atgggctagg 
gatcttttga 
tacatgcata 
ctcctataat 
tatagtttgt 
gcaatctcaa 
agtagctggg 
agacggagtc 



tctggccagt 
tatacaggac 
tttattcaat 
ctgttaaata 
ttgttttgag 
ctcactataa 
accacagtca 
tcgctctgtc 



ttacagaaag 
ctaaatggaa 
acacatatgt 
tgtgtgtgtg 
agggagtctt 
cttccacctc 
catgccacca 
acccaggcta 



60 
120 
180 
240 
300 
360 
420 
480 
501 



<210> 2 

<211> 600 

<212> DNA 

<213> Homo sapiens 



<400> 2 

gctgctgcct 

tacagtgtgt 

agccatgggc 

agccccgggt 

tgaagcaaga 

cacacacaca 

cctgttgtca 

cttttccaaa 

cctgggtagc 

ctcatttttc 



ggatgaactt 
ccttggctaa 
cttgcacttg 
tctcctctga 
gcacagtaag 
cacacacaca 
ttcagagagg 
acaggatgcg 
agcaggccac 
tgggccaggt 



caagtgttcc 
aggcagcctg 
aactagggtc 
gaaatgctgg 
acaaaagact 
cagagtagtg 
cgagtgcact 
taatggctgc 
cagagagtgt 
tcttgcctgc 



ttctcctgtt 
ggcacactgc 
tggccccagg 
ggtcacctgc 
acagagccga 
cattccagaa 
gggagccaca 
tgtggccact 
gcatccctcc 
ctgcaatcct 



ccctgctaca 
agtgccctgg 
actgtgacat 
tttgagggct 
cacacacaca 
caatatactc 
aaagtgcaat 
gctggcggtg 
ttctgtgctc 
cctctgatga 



tcttagattt 
tgtaacggac 
gctcaccctg 
gttcttagta 
cacacacaca 
atttcatttt 
gttgcaaaga 
tgtggggata 
tgcagtgggg 
cggagttacc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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<211> 4333 

<212> DNA 

<213> Homo sapiens 
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<400> 3 
atgtccttcc 
gcggtcgtcg 
tcaggccggt 
gcggcccttg 
tctacccaaa 
gcggagacag 
actgaaagga 
agaccaagga 
tatgagtctt 
gaaatactcg 
aacaaaaagg 
agaccaaacc 
gacagcttct 
tgcagatgaa 
gctccctgat 
aggggaggtg 
ctggcagaca 
aggaatcaat 
gctttccttc 
aggagctcac 
gatcagcccc 
tcataaagaa 
tcttgattgc 
agacatcagg 
agagaagaga 
cctcaaagcc 
cctggaagag 
tgctctgtcc 
ccagtataaa 
tcaaggacaa 
ttcaggccct 
catacctagg 
tgctgatgag 
cggtaaaagt 
gaaatcggga 
ggaggggagg 
gagcacagaa 
cctggtggcc 
gcccttgtca 
tggcctccta 
cctgagggga 
tcctttggta 
actcaagtcc 
acgcaggttt 
tatataagtg 
ttgaacagcc 
gacactatgt 
gccagcccct 
tggccttctt 
tctgctttcc 
aagctcggtg 
ttactgaggt 
cacatgtccc 
atttaatatt 
agaagccaaa 
ctcagtggac 
gcccactgcc 
cagtgctctg 
aaataactcc 
aaggcacaag 



ggggccacgc 
tacgcctgaa 
cctccggggc 
cgccgacgcc 
tacgcggcgg 
tgcagggaag 
atatctgctt 
gatgacctgg 
attaatccca 
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agaaagtgcc gccgctttgg ccactcaga^ cccccgggcc 60 

ggcgggtcgt gccggcggcc gctctagtct ccgcctccgc 120 

ttctcaatgg tttcccggtg gcctctcaat ggttttcccg 180 

aggagacttc cggagcttgg tgacgtcacg agcgagcttt 240 

gggaataggc tcgagggcgg tgagcagtga caattgctag 300 

agagacctta gaaaggatca ggactggcgg gaggtattta 360 

cactgttgca accaaaccag atgccttctt ccacttcacc 420 

agaactgcat tttaagattt tctgacctgg atttaaaaga 480 

gcagcagtct taaagcagaa ttagatggca gtacaaaaaa 540 

tttgcaaaga aaaaggcctt tgcccttttt gtcaaaacca aagaagttcc 600 

agttttgaat gtaaagaaaa attgtggaaa tgctgtcggc agctattcac 660 

agcatccata gacatgtggc aacacaacat gctgatgaaa tttatcacca 720 

attttaaagc aactggctgt gacattgagc acctcaaaga gtctttcgtc 780 

aagaaccctt taaaagagtg ccttccacat agccatgacg tgtctgcttg 840 

ataagctgct ttaaccctga tgagctgata agtggccagg gcagtgaaga 900 

ctcctttatt actgctacca tgacctggag gatccccaat ggatctgtgc 960 

gctctgtgtc agcacctgca cctcacaggc aagattcgaa ttgctgcaga 1020 

gggacagttg gtggaagcaa attggctacc agactttatg tggaagtcat 1080 

ccattgttta aggatgacct gtgtaaagat gattttaaga ccagcaaagg 114 0 

tgttttccag aattgcgtgt tggtgtattt gaagaaatcg tgcccatggg 1200 

aaaaagatct cctacaagaa gcctggaatc catttatccc caggtgaatt 12 60 

gtagaaaagt ttttatctca ggcaaatcaa gaacaaagtg atactatcct 1320 

agaaacttct atgaaagcaa aataggacga ttccaaggct gcttagcccc 1380 

aaattcagtt acttccctag ctacgttgac aaaaatctag aacttttcag 1440 

gtgctgatgt actgtaccgg gggcatccgc tgtgagcggg gttcagccta 1500 

aagggagtgt gcaaggaggt gttccagctc aagggtggca tccacaagta 1560 

tttcctgatg gcttttacaa agggaagttg tttgtttttg atgaacgcta 1620 

tacaacagtg atgtggtgtc agagtgttca tactgtggag cccgctggga 1680 

ctctgctcta ctccccagtg ccgccagctc gttttgacct gccctgcctg 1740 

ggattcacag cctgttgtgt cacatgtcaa gacaagggga gcaggaaagt 1800 

atgcaagaca gctttaaaga ggaatgcgag tgcacagccc gacggccacg 1860 

gaactcttgc agcatgtgcg acagcctgtg agcccagagc cagggcctga 1920 

gatgggccag tgcttatgtg agcagcacct ttggcatttt cccaggccct 1980 

aggtttgggg tgactataca gagaaagcat ggcaagactg cagaaacaga 2040 

acttcagttc tggccgctgc caccgtggca gccgtctaca cttcacagcg 2100 

agtcacgttg tctaccactt acctgagaca ttctgatttg gatgatgcta 2160 

aataggtgag ctgcatggga tcccaaagct gctgagggat agagcctgag 2220 

acagcatatg ccctttctgt tccatgcagc tggggctgtt agtagtcatt 2280 

gcagaccttc taccctggtg gcaaacacat gaaagctgtg gccctgggag 2340 

aaacaagcca cttaggtcat ctgccatcta cccttaacct ctgtctctcg 2400 

atctgcaagc tgtgcattgg gcttacctcc tgcttttgta gaaataacca 24 60 

tacatggagg atagttccag aacgcctgag tatacaaaaa cccaatgcat 2520 

cacagtgggc cctacagaac ccacgtatgt gataaatcag ccctccatgt 2580 

cgccccctgc caatactgta ttttcaacct gtatggttga aaaaaatcca 2640 

cagccatgca gttcaaaccc atattgttca agggtcaact gtatagttta 2700 

acacccattc ctttacacat gatctatggc agagttgaat agttgcaaca 2760 

ggcctgcaaa atcggaaatt tttactgtct ggccttttac agaaaagttt 2820 

gatctagacc agcagctcat ctgatagagg cagaggtggc cttaaagatg 2880 

cattttctgt tggtttggtt tcgtttctat gagagatttc ctctgatagc 2940 

ccagcactta ctctctgagc ttttaaatgt tctctctggg agcttcatat 3000 

acatttgagc cacagttttt agatcagcac ctggaataca tgacacattc 3060 

catccagcac tgccatggtg gctgcccagt cttctggcca gtgtgccagg 3120 

tgtcacacag gttccaagaa acacatacgc agccatgcat agaccaacag 3180 

atattgcagt tttcagcgat gcagaatgca gctgcaattg tgttttaagg 3240 

tggggatggt: tgtccctgca acatggtgcc actcctgggc catgtgcagc 3300 

actcttccat agcgctgagg ccctggcccc gcctccagtt accctgtact 3360 

ttacagttca gtgcgcaggc cttcaccttt tcatcaccag cctctctgct 3420 

gagttcttga ccttgtcctt tatcatgaga tttgctgaaa tcactaatga 3480 

caaaagcaac aaacaaaaat attagtttaa ctggcactgt ggtatattaa 354 0 

ggcattgtgg cttaacactt ttgctggatc ccaagagacg cacatgatgt 3600 
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taaaaagaga 
ttggccagga 
gcggaggctt 
agtagctgaa 
tttatccctc 
tgttttcaaa 
ggaaaaagtt 
ctcagggaca 
tttcaccatt 
actcctggca 
ctggatccag 
aataaagaat 
tttataatac 



tctggcagca 
tttcccaatt 
aatatacagg 
gtctctaagc 
tagtgtctta 
agtatttggc 
ctaactccag 
tgtttgttgc 
ggtttttttc 
gagttcagtg 
taatcaattt 
gataatcagt 
ttc 
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gtactaatac 
ccttgatatc 
cgtaaggtca 
tgggccatta 
cctcacataa 
cgtagatttt 
tattaaattg 
ctgggattgg 
ctatcatttc 
gcttcagtct 
ggcaacttta 
tttatttgct 



tacatttcag 
tggagtttct 
gcagcaattt 
ctaaatacca 
gccatttgcc 
cacatccatc 
tacataaatc 
taatgaaagg 
tgcatatcca 
gtctatctgt 
atcgaggttt 
aatagctaag 



:cTtc 



tgtaatcatc rtggggtggt 
tcaccattgt ccggcatcct 
gtctaataag tgatgagatc 
tagccatgtt gatctggaaa 
cactgtgcaa tatagaaagg 
ataaggttgg cattcaataa 



ccaaatgttc 
ttggtttttg 
gcaaaaggaa 
tctgagggga 
tcaaaattcc 
acaaatttgt 



ttaaagaaca 
aaacttgaaa 
tctcatgttg 
aaattgtgtt 
aaggagggtt 
aataaagtgt 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4333 



<210> 4 

<211> 516 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Pro Ser Ser Thr Ser Pro Asp Gin Gly Asp Asp Leu Glu Asn Cys 
15 10 15 

lie Leu Arg Phe Ser Asp Leu Asp Leu Lys Asp Met Ser Leu lie Asn 
20 25 30 

Pro Ser Ser Ser Leu Lys Ala Glu Leu Asp Gly Ser Thr Lys Lys Lys 
35 40 45 

Tyr Ser Phe Ala Lys Lys Lys Ala Phe Ala Leu Phe Val Lys Thr Lys 
50 55 60 

Glu Val Pro Thr Lys Arg Ser Phe Glu Cys Lys Glu Lys Leu Trp Lys 
65 70 75 80 

Cys Cys Arg Gin Leu Phe Thr Asp Gin Thr Ser lie His Arg His Val 
85 90 95 

Ala Thr Gin His Ala Asp Glu He Tyr His Gin Thr Ala Ser He Leu 
100 105 HO 

Lys Gin Leu Ala Val Thr Leu Ser Thr Ser Lys Ser Leu Ser Ser Ala 
115 120 125 

Asp Glu Lys Asn Pro Leu Lys Glu Cys Leu Pro His Ser His Asp Val 
130 135 140 

Ser Ala Trp Leu Pro Asp He Ser Cys Phe Asn Pro Asp Glu Leu He 
145 " 150 155 160 

Ser Gly Gin Gly Ser Glu Glu Gly Glu Val Leu Leu Tyr Tyr Cys Tyr 
165 170 175 

His Asp Leu Glu Asp Pro Gin Trp He Cys Ala Trp Gin Thr Ala Leu 
180 185 190 

Cys Glri His Leu His Leu Thr Gly Lys He Arg He Ala Ala Glu Gly 
195 200 205 

He Asn Gly Thr Val Gly Gly Ser Lys Leu Ala Thr Arg Leu Tyr Val 
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210 




215 



220 




Glu Val Met Leu Ser Phe Pro Leu Phe Lys Asp Asp Leu Cys Lys Asp 
225 230 235 240 

Asp Phe Lys Thr Ser Lys Gly Gly Ala His Cys Phe Pro Glu Leu Arg 
245 250 255 

Val Gly Val Phe Glu Glu lie Val Pro Met Gly lie Ser Pro Lys Lys 
260 265 270 

He Ser Tyr Lys Lys Pro Gly He His Leu Ser Pro Gly Glu Phe His 
275 " 280 285 

Lys Glu Val Glu Lys Phe Leu Ser Gin Ala Asn Gin Glu Gin Ser Asp 
290 295 300 

Thr He Leu Leu Asp Cys Arg Asn Phe Tyr Glu Ser Lys He Gly Arg 
305 310 315 320 

Phe Gin Gly Cys Leu Ala Pro Asp He Arg Lys Phe Ser Tyr Phe Pro 
325 330 335 

Ser Tyr Val Asp Lys Asn Leu Glu Leu Phe Arg Glu Lys Arg Val Leu 
340 345 350 

Met Tyr Cys Thr Gly Gly He Arg Cys Glu Arg Gly Ser Ala Tyr Leu 
355 " 360 365 

Lys Ala Lys Gly Val Cys Lys Glu Val Phe Gin Leu Lys Gly Gly He 
370 375 380 

His Lys Tyr Leu Glu Glu Phe Pro Asp Gly Phe Tyr Lys Gly Lys Leu 
385 390 395 400 

Phe Val Phe Asp Glu Arg Tyr Ala Leu Ser Tyr Asn Ser Asp Val Val 
405 410 415 

Ser Glu Cys Ser Tyr Cys Gly Ala Arg Trp Asp Gin Tyr Lys Leu Cys 
420 425 430 

Ser Thr Pro Gin Cys Arg Gin Leu Val Leu Thr Cys Pro Ala Cys Gin 
435 440 445 

Gly Gin Gly Phe Thr Ala Cys Cys Val Thr Cys Gin Asp Lys Gly Ser 
450 455 460 

Arg Lys Val Ser Gly Pro Met Gin Asp Ser Phe Lys Glu Glu Cys Glu 
465 470 475 480 

Cys Thr Ala Arg Arg Pro Arg He Pro Arg Glu Leu Leu Gin His Val 
485 490 495 

Arg Gin Pro Val Ser Pro Glu Pro Gly Pro Asp Ala Asp Glu Asp Gly 
500 505 510 

Pro Val Leu Met 
515 



<210> 5 
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<211> 526 
<212> PRT 
<213> Mus musculus 

<400> 5 

Met Pro Ser Ser Thr Ser Pro Asp Glu Glu Asp Gly Leu Glu Thr Cys 
15 10 15 

Val Leu Lys Val Phe Asp Leu Asp Leu Lys Glu Ser Asn Leu Val Asn 
20 25 30 

Pro Ser Asn Ser Leu Lys Ala Glu Leu Asp Gly Ser Thr Lys Lys Lys 
35 40 45 

Tyr Ser Phe Ala Lys Lys Lys Ala Phe Ala Leu Leu Val Lys Thr Lys 
50 55 60 

Gin Val Pro Ala Pro Ser Tyr Glu Phe Lys Gly Lys Arg Trp Arg Cys 
65 70 75 80 

Cys Gin Gin Leu Phe Ala Asp Gin lie Ser lie His Arg His Val Ala 
85 90 95 

Thr Gin His Ala Glu Asp Val Tyr Gin Gin Thr Ala Ser Leu Leu Lys 
100 105 110 

Gin Leu Thr Ala Ala Leu Ser Ala Ser Gin Ser Leu Thr Pro Thr Asp 
115 120 125 

Lys Arg Ser Ser Pro Lys Asp Cys Leu Thr Pro Ser Gin Glu Val Ser 
130 135 140 

Ala Trp Leu Pro Asp Val Ser His Val Ser Pro Gin Glu Leu Arg Ser 
145 150 155 160 

Gly Gin Val Thr Glu Glu Arg Glu Val Leu Leu Tyr Tyr Cys Tyr Cys 
165 170 175 

Asp Leu Glu Asp Pro His Trp Val Cys Ala Trp Gin Thr Ala Leu Cys 
180 185 190 

His His Leu His Leu Thr Gly Lys lie Arg lie Ala Thr Glu Gly He 
195 200 205 

Asn Gly Thr Val Gly Gly Ser Lys Val Ala Thr Arg Leu Tyr Val Glu 
210 215 220 

Val Met Leu Ser Cys Pro Leu Phe Lys Asp Tyr Leu Ser Glu Asp Asp 
225 230 235 240 

Phe Lys Ser Ser Lys Gly Gly Ser His Cys Phe Pro Glu Leu Arg Val 
245 250 255 

Gly Val Phe Glu Glu He Val Pro Met Gly He Ser Pro Ser Gin Val 
260 265 270 

Ser Tyr Lys Lys Pro Gly He His Leu Ser Pro Gly Glu Phe His Lys 
275 280 285 

Glu He Glu Lys Leu Leu Ser Gin Ser Ser Glu Glu Gin Gly Asn Thr 
290 295 300 
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lie lie Leu Asp Cys Arg Asn Phe Tyr Glu Ser Lys lie Gly Arg Phe 
305 ' 310 315 320 

Gin Gly Cys Leu Ala Pro Asp lie Arg Lys Phe Ser Tyr Phe Pro Ser 
325 330 335 

Tyr Val Asp Lys Asn Leu Asp lie Phe Arg Gin Lys Arg Val Leu Met 
340 345 350 

Tyr Cys Thr Gly Gly He Arg Cys Glu Arg Gly Ser Ala Tyr Leu Arg 
355 360 365 

Ala Lys Gly Val Cys Lys Glu Val Phe Gin Leu Lys Gly Gly He His 
370 ~ " 375 380 

Lys Tyr Leu Glu Glu Phe Pro Asp Gly Phe Tyr Lys Gly Lys Leu Phe 
385 390 395 400 

Val Phe Asp Glu Arg Phe Ala Leu Ala Tyr Asn Ser Ser Val Val Ser 
405 410 415 

Glu Cys Ser Tyr Cys Gly Ala Pro Trp Asp Gin Tyr Lys Leu Cys Ser 
420 425 430 

Thr Pro Gin Cys Arg Gin Leu Val Leu Thr Cys Ser Ala Cys Gin Gly 
435 440 445 

Gin Gly Phe Thr Ala Cys Cys Val Thr Cys Gin Asp Lys Gly Gly Lys 
450 455 460 

Gin Ala Ser Gly Pro Ser Gin Asp Ser Phe Lys Glu Glu Cys Glu Cys 
465 ^ 470 475 480 

Thr Ala Arg Arg His Glu Ser His Arg Asn Ser Arg His Ser His Glu 
485 490 495 

Phe Ser Pro Cys Glu Pro Gly Pro Gly Pro Gly Val Pro His Ser Leu 
500 505 510 

Thr His Ala Asp Leu Ser Cys His Val Gin Leu Glu Thr Val 
515 520 525 



<210> 6 

<211> 4333 

<212> DNA 

<213> Homo sapiens 

<400> 6 

atgtccttcc ggggccacgc agaaagtgcc gccgctttgg ccactcagag cccccgggcc 

gcggtcgtcg tacgcctgaa ggcgggtcgt gccggcggcc gctctagtct ccgcctccgc 120 

tcaggccggt cctccggggc ttctcaatgg tttcccggtg gcctctcaat ggttttcccg 180 

gcggcccttg cgccgacgcc aggagacttc cggagcttgg tgacgtcacg agcgagcttt 240 

tctacccaaa tacgcggcgg gggaataggc tcgagggcgg tgagcagtga caattgctag 300 

gcggagacag tgcagggaag agagacctta gaaaggatca ggactggcgg gaggtattta 360 

actgaaagga atatctgctt cactgttgca accaaaccag atgccttctt ccacttcacc 420 

agaccaagga gatgacctgg agaactgcat tttaagattt tctgacctgg atttaaaaga 480 

tatgagtctt attaatccca gcagcagtct taaagcagaa ttagatggca gtacaaaaaa 540 

gaaatactcg tttgcaaaga aaaaggcctt tgcccttttt gtcaaaacca aagaagttcc 600 



60 



6 



WO 2004/003159 PC77US2003/020410 

aacaaaaagg agttt^JEat gtaaagaaaa attgtggaaa tgctgtcgg^agctattcac 660 

agaccaaacc agcatccata gacatgtggc aacacaacat gctgatgaaa tttatcacca 720 

gacagcttct attttaaagc aactggctgt gacattgagc acctcaaaga gtctttcgtc 780 

tgcagatgaa aagaaccctt taaaagagtg ccttccacat agccatgacg tgtctgcttg 840 

gctccctgat ataagctgct ttaaccctga tgagctgata agtggccagg gcagtgaaga 900 

aggggaggtg ctcctttatt actgctacca tgacctggag gatccccaat ggatctgtgc 960 

ctggcagaca gctctgtgtc agcacctgca cctcacaggc aagattcgaa ttgctgcaga 1020 

aggaatcaat gggacagttg gtggaagcaa attggctacc agactttatg tggaagtcat 1080 

gctttccttc ccattgttta aggatgacct gtgtaaagat gattttaaga ccagcaaagg 114 0 

aggagctcac tgttttccag aattgcgtgt tggtgtattt gaagaaatcg tgcccatggg 1200 

gatcagcccc aaaaagatct cctacaagaa gcctggaatc catttatccc caggtgaatt 12 60 

tcataaagaa gtagaaaagt ttttatctca ggcaaatcaa gaacaaagtg atactatcct 1320 

tcttgattgc agaaacttct atgaaagcaa aataggacga ttccaaggct gcttagcccc 1380 

agacatcagg aaattcagtt acttccctag ctacgttgac aaaaatctag aacttttcag 14 4 0 

agagaagaga gtgctgatgt actgtaccgg gggcatccgc tgtgagcggg gttcagccta 1500 

cctcaaagcc aagggagtgt gcaaggaggt gttccagctc aagggtggca tccacaagta 1560 

cctggaagag tttcctgatg gcttttacaa agggaagttg tttgtttttg atgaacgcta 1620 

tgctctgtcc tacaacagtg atgtggtgtc agagtgttca tactgtggag cccgctggga 1680 

ccagtataaa ctctgctcta ctccccagtg ccgccagctc gttttgacct gccctgcctg 1740 

tcaaggacaa ggattcacag cctgttgtgt cacatgtcaa gacaagggga gcaggaaagt 1800 

tgcaggccct atgcaagaca gctttaaaga ggaatgcgag tgcacagccc gacggccacg 1860 

catacctagg gaactcttgc agcatgtgcg acagcctgtg agcccagagc cagggcctga 1920 

tgctgatgag gatgggccag tgcttatgtg agcagcacct ttggcatttt cccaggccct 1980 

cggtaaaagt aggtttgggg tgactataca gagaaagcat ggcaagactg cagaaacaga 2040 

gaaatcggga acttcagttc tggccgctgc caccgtggca gccgtctaca cttcacagcg 2100 

ggaggggagg agtcacgttg tctaccactt acctgagaca ttctgatttg gatgatgcta 2160 

gagcacagaa aataggtgag ctgcatggga tcccaaagct gctgagggat agagcctgag 2220 

cctggtggcc acagcatatg ccctttctgt tccatgcagc tggggctgtt agtagtcatt 2280 

gcccttgtca gcagaccttc taccctggtg gcaaacacat gaaagctgtg gccctgggag 2340 

tggcctccta aaacaagcca cttaggtcat ctgccatcta cccttaacct ctgtctctcg 24 00 

cctgagggga atctgcaagc tgtgcattgg gcttacctcc tgcttttgta gaaataacca 2460 

tcctttggta tacatggagg atagttccag aacgcctgag tatacaaaaa cccaatgcat 2520 

actcaagtcc cacagtgggc cctacagaac ccacgtatgt gataaatcag ccctccatgt 2580 

acgcaggttt cgccccctgc caatactgta ttttcaacct gtatggttga aaaaaatcca 2 64 0 

tatataagtg cagccatgca gttcaaaccc atattgttca agggtcaact gtatagttta 2700 

ttgaacagcc acacccattc ctttacacat gatctatggc agagttgaat agttgcaaca 2760 

gacactatgt ggcctgcaaa atcggaaatt tttactgtct ggccttttac agaaaagttt 2820 

gccagcccct gatctagacc agcagctcat ctgatagagg cagaggtggc cttaaagatg 2880 

tggccttctt cattttctgt tggtttggtt tcgtttctat gagagatttc ctctgatagc 2940 

tctgctttcc ccagcactta ctctctgagc ttttaaatgt tctctctggg agcttcatat 3000 

aagctcggtg acatttgagc cacagttttt agatcagcac ctggaataca tgacacattc 3060 

ttactgaggt catccagcac tgccatggtg gctgcccagt cttctggcca gtgtgccagg 3120 

cacatgtccc tgtcacacag gttccaagaa acacatacgc agccatgcat agaccaacag 3180 

atttaatatt atattgcagt tttcagcgat gcagaatgca gctgcaattg tgttttaagg 3240 

agaagccaaa tggggatggt tgtccctgca acatggtgcc actcctgggc catgtgcagc 3300 

ctcagtggac actcttccat agcgctgagg ccctggcccc gcctccagtt accctgtact 3360 

gcccactgcc ttacagttca gtgcgcaggc cttcaccttt tcatcaccag cctctctgct 3420 

cagtgctctg gagttcttga ccttgtcctt tatcatgaga tttgctgaaa tcactaatga 3480 

aaataactcc caaaagcaac aaacaaaaat attagtttaa ctggcactgt ggtatattaa 354 0 

aaggcacaag ggcattgtgg cttaacactt ttgctggatc ccaagagacg cacatgatgt 3600 

taaaaagaga tctggcagca gtactaatac tacatttcag tgtaatcatc ttggggtggt 3660 

ttggccagga tttcccaatt ccttgatatc tggagtttct tcaccattgt ccggcatcct . 3720 

gcggaggctt aatatacagg cgtaaggtca gcagcaattt gtctaataag tgatgagatc 3780 

agtagctgaa gtctctaagc tgggccatta ctaaatacca tagccatgtt gatctggaaa 3840 

tttatccctc tagtgtctta cctcacataa gccatttgcc cactgtgcaa tatagaaagg 3900 

tgttttcaaa agtatttggc cgtagatttt cacatccatc ataaggttgg cattcaataa 3960 

ggaaaaagtt ctaactccag tattaaattg tacataaatc ccaaatgttc ttaaagaaca 4020 

ctcagggaca tgtttgttgc ctgggattgg taatgaaagg ttggtttttg aaacttgaaa 4 080 

tttcaccatt ggtttttttc ctatcatttc tgcatatcca gcaaaaggaa tctcatgttg 4140 

actcctggca gagttcagtg gcttcagtct gtctatctgt tctgagggga aaattgtgtt 4 200 

ctggatccag taatcaattt ggcaacttta atcgaggttt tcaaaattcc aaggagggtt 4260 
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aataaagaat gatS^agt tttatttgct aatagctaag acaaatt^^aataaagtgt 4 320 

tttataatac ttc - 4333 



<210> 7 

<211> 529 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Leu Lys Gly lie Ser Ala Ser Leu Leu Gin Pro Asn Gin Met Pro Ser 
15 10 15 

Ser Thr Ser Pro Asp Gin Gly Asp Asp Leu Glu Asn Cys lie Leu Arg 
20 25 30 

Phe Ser Asp Leu Asp Leu Lys Asp Met Ser Leu lie Asn Pro Ser Ser 
35 40 45 

Ser Leu Lys Ala Glu Leu Asp Gly Ser Thr Lys Lys Lys Tyr Ser Phe 
50 55 60 

Ala Lys Lys Lys Ala Phe Ala Leu Phe Val Lys Thr Lys Glu Val Pro 
65 * 70 75 80 

Thr Lys Arg Ser Phe Glu Cys Lys Glu Lys Leu Trp Lys Cys Cys Arg 
85 90 95 

Gin Leu Phe Thr Asp Gin Thr Ser lie His Arg His Val Ala Thr Gin 
100 105 110 

His Ala Asp Glu lie Tyr His Gin Thr Ala Ser lie Leu Lys Gin Leu 
115 120 125 

Ala Val Thr Leu Ser Thr Ser Lys Ser Leu Ser Ser Ala Asp Glu Lys 
130 135 140 

Asn Pro Leu Lys Glu Cys Leu Pro His Ser His Asp Val Ser Ala Trp 
145 150 155 160 

Leu Pro Asp lie Ser Cys Phe Asn Pro Asp Glu Leu lie Ser Gly Gin 
165 170 175 

Gly Ser Glu Glu Gly Glu Val Leu Leu Tyr Tyr Cys Tyr His Asp Leu 
180 185 190 

Glu Asp Pro Gin Trp lie Cys Ala Trp Gin Thr Ala Leu Cys Gin His 
195 200 205 

Leu His Leu Thr Gly Lys lie Arg lie Ala Ala Glu Gly lie Asn Gly 
210 215 220 

Thr Val Gly Gly Ser Lys Leu Ala Thr Arg Leu Tyr Val Glu Val Met 
225 230 235 240 

Leu Ser Phe Pro Leu Phe Lys Asp Asp Leu Cys Lys Asp Asp Phe Lys 
245 250 255 

Thr Ser Lys Gly Gly Ala His Cys Phe Pro Glu Leu Arg Val Gly Val 
260 265 270 
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Phe Glu Glu II 
275 




Pro Met Gly He 
280 



Ser Pro Lys Lys Ire Ser Tyr 
285 



Lys Lys Pro Gly 
290 



He His Leu Ser Pro 
295 



Gly Glu Phe His Lys Glu Val 
300 



Glu Lys Phe Leu 
305 



Ser Gin Ala Asn Gin 
310 



Glu Gin Ser Asp Thr lie Leu 
315 320 



Leu Asp Cys Arg 



Asn 
325 



Phe Tyr Glu Ser 



Lys He Gly Arg Phe Gin Gly 
330 335 



Cys Leu Ala Pro Asp 
340 



He Arg Lys Phe 
345 



Ser Tyr Phe Pro Ser Tyr Val 
350 



Asp Lys Asn Leu 
355 



Glu Leu Phe Arg Glu 
360 



Lys Arg Val Leu Met Tyr Cys 
365 



Thr Gly Gly He Arg 
370 

Gly Val Cys Lys Glu 
385 



Cys Glu Arg Gly 
375 

Val Phe Gin Leu 

390 



Ser Ala Tyr Leu Lys Ala Lys 
380 

Lys Gly Gly He His Lys Tyr 
395 400 



Leu Glu Glu Phe 



Pro 
405 



Asp Gly Phe Tyr 



Lys Gly Lys Leu Phe Val Phe 
410 415 



Asp Glu Arg Tyr 
420 



Ala Leu Ser Tyr Asn 
425 



Ser Asp Val Val Ser Glu Cys 
430 



Ser Tyr Cys Gly 
435 



Ala Arg Trp Asp Gin 
440 



Tyr Lys Leu Cys Ser Thr Pro 
445 



Gin Cys Arg Gin 
450 



Leu Val Leu Thr Cys 
455 



Pro Ala Cys Gin Gly Gin Gly 
4 60 



Phe Thr Ala Cys Cys 
465 



Val Thr Cys Gin 
470 



Asp Lys Gly Ser Arg Lys Val 
475 480 



Ala Gly Pro Met 



Gin 
485 



Asp Ser Phe Lys 



Glu Glu Cys Glu Cys Thr Ala 
490 495 



Arg Arg Pro Arg 
500 



He Pro Arg Glu Leu 
505 



Leu Gin His Val Arg Gin Pro 
510 



Val Ser Pro Glu 
515 



Pro Gly Pro Asp Ala 
520 



Asp Glu Asp Gly Pro Val Leu 
525 



Met 



<210> 8 

<211> 4334 

<212> DNA 

<213> Homo sapiens 

<400> 8 

atgtccttcc ggggccacgc agaaagtgcc gccgctttgg ccactcagag cccccgggcc 60 

gcggtcgtcg tacgcctgaa ggcgggtcgt gccggcggcc gctctagtct ccgcctccgc 120 

tcaggccggt cctccggggc ttctcaatgg tttcccggtg gcctctcaat ggttttcccg 180 

gcggcccttg cgccgacgcc aggagacttc cggagcttgg tgacgtcacg agcgagcttt 24 0 
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A to. 

tctacccaaa ta^pEjgcgg gggaataggc tcgagggcgg tgagca^|Mt' caattgctag 300 

gcggagacag tgcagggaag agagacctta gaaaggatca ggactgg&jg gaggtattta 360 

actgaaagga atatctgctt cactgttgca accaaaccag atgccttctt ccacttcacc 4 20 

agaccaagga gatgacctgg agaactgcat tttaagattt tctgacctgg atttaaaaga 4 80 

tatgagtctt attaatccca gcagcagtct taaagcagaa ttagatggca gtacaaaaaa 540 

gaaatactcg tttgcaaaga aaaaggcctt tgcccttttt gtcaaaacca aagaagttcc 600 

aacaaaaagg agttttgaat . gtaaagaaaa attgtggaaa tgctgtcggc agctattcac 660 

agaccaaacc agcatccata gacatgtggc aacacaacat gctgatgaaa tttatcacca 720 

gacagcttct attttaaagc aactggctgt gacattgagc acctcaaaga gtctttcgtc 780 

tgcagatgaa aagaaccctt taaaagagtg ccttccacat agccatgacg tgtctgcttg 840 

gctccctgat ataagctgct ttaaccctga tgagctgata agtggccagg gcagtgaaga 900 

aggggaggtg ctcctttatt actgctacca tgacctggag gatccccaat ggatctgtgc 960 

ctggcagaca gctctgtgtc agcacctgca cctcacaggc aagattcgaa ttgctgcaga 1020 

aggaatcaat gggacagttg gtggaagcaa attggctacc agactttatg tggaagtcat 1080 

gctttccttc ccattgttta aggatgacct gtgtaaagat gattttaaga ccagcaaagg 1140 

aggagctcac tgttttccag aattgcgtgt tggtgtattt gaagaaatcg tgcccatggg 1200 

gatcagcccc aaaaagatct cctacaagaa gcctggaatc catttatccc caggtgaatt 1260 

tcataaagaa gtagaaaagt ttttatctca ggcaaatcaa gaacaaagtg atactatcct 1320 

tcttgattgc agaaacttct atgaaagcaa aataggacga ttccaaggct gcttagcccc 1380 

agacatcagg aaattcagtt acttccctag ctacgttgac aaaaatctag aacttttcag 1440 

agagaagaga gtgctgatgt actgtaccgg gggcatccgc tgtgagcggg gttcagccta 1500 

cctcaaagcc aagggagtgt gcaaggaggt gttccagctc aagggtggca tccacaagta 1560 

cctggaagag tttcctgatg gcttttacaa agggaagttg tttgtttttg atgaacgcta 1620 

tgctctgtcc tacaacagtg atgtggtcrtc agagtgttca tactgtggag cccgctggga 1680 

ccagtataaa ctctgctcta ctcccccagt gccgccagct cgttttgacc tgccctgcct 174 0 

gtcaaggaca aggattcaca gcctgttgtg tcacatgtca agacaagggg agcaggaaag 1800 

tttcaggccc tatgcaagac agctttaaag aggaatgcga gtgcacagcc cgacggccac 1860 

gcatacctag ggaactcttg cagcatgtgc gacagcctgt gagcccagag ccagggcctg 1920 

atgctgatga ggatgggcca gtgcttatgt gagcagcacc tttggcattt tcccaggccc 1980 

tcggtaaaag taggtttggg gtgactatac agagaaagca tggcaagact gcagaaacag 2040 

agaaatcggg aacttcagtt ctggccgctg ccaccgtggc agccgtctac acttcacagc 2100 

gggaggggag gagtcacgtt gtctaccact tacctgagac attctgattt ggatgatgct 2160 

agagcacaga aaataggtga gctgcatggg atcccaaagc tgctgaggga tagagcctga 2220 

gcctggtggc cacagcatat gccctttctg ttccatgcag ctggggctgt tagtagtcat 2280 

tgcccttgtc agcagacctt ctaccctggt ggcaaacaca tgaaagctgt ggccctggga 234 0 

gtggcctcct aaaacaagcc acttaggtca tctgccatct acccttaacc tctgtctctc 2400 

gcctgagggg aatctgcaag ctgtgcattg ggcttacctc ctgcttttgt agaaataacc 24 60 

atcctttggt atacatggag gatagttcca gaacgcctga gtatacaaaa acccaatgca 2520 

tactcaagtc ccacagtggg ccctacagaa cccacgtatg tgataaatca gccctccatg 2580 

tacgcaggtt tcgccccctg ccaatactgt attttcaacc tgtatggttg aaaaaaatcc 2640 

atatataagt gcagccatgc agttcaaacc catattgttc aagggtcaac tgtatagttt 2700 

attgaacagc cacacccatt cctttacaca tgatctatgg cagagttgaa tagttgcaac 2760 

agacactatg tggcctgcaa aatcggaaat ttttactgtc tggcctttta cagaaaagtt 2820 

tgccagcccc tgatctagac cagcagctca tctgatagag gcagaggtgg ccttaaagat 2880 

gtggccttct tcattttctg ttggtttggt ttcgtttcta tgagagattt cctctgatag 2940 

ctctgctttc cccagcactt actctctgag cttttaaatg ttctctctgg gagcttcata 3000 

taagctcggt gacatttgag ccacagtttt tagatcagca cctggaatac atgacacatt 3060 

cttactgagg tcatccagca ctgccatggt ggctgcccag tcttctggcc agtgtgccag 3120 

gcacatgtcc ctgtcacaca ggttccaaga aacacatacg cagccatgca tagaccaaca 3180 

gatttaatat tatattgcag ttttcagcga tgcagaatgc agctgcaatt gtgttttaag 3240 

gagaagccaa atggggatgg ttgtccctgc aacatggtgc cactcctggg ccatgtgcag 3300 

cctcagtgga cactcttcca tagcgctgag gccctggccc cgcctccagt taccctgtac 3360 

tgcccactgc cttacagttc agtgcgcagg ccttcacctt ttcat caeca gcctctctgc 3420 

teagtgetet ggagttcttg accttgtcct ttatcatgag atttgetgaa atcactaatg 3480 

aaaataactc ccaaaagcaa caaacaaaaa tattagttta actggcactg tggtatatta 3540 

aaaggcacaa gggcattgtg gcttaacact tttgctggat cccaagagac gcacatgatg 3600 

ttaaaaagag atctggcagc agtactaata ctacatttca gtgtaatcat cttggggtgg 3660 

tttggccagg atttcccaat tccttgatat ctggagtttc ttcaccattg tccggcatcc 3720 

tgeggagget taatatacag gcgtaaggtc agcagcaatt tgtctaataa gtgatgagat 3780 

cagtagctga agtctctaag ctgggccatt actaaatacc atagecatgt tgatctggaa 3840 

atttatccct ctagtgtctt acctcacata agecatttge ccactgtgca atatagaaag 3900 
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gtgttttcaa 
aggaaaaagt 
actcagggac 
atttcaccat 
gactcctggc 
tctggatcca 
taataaagaa 
ttttataata 



4 

aag^^tgg 

tctaatetcca 
atgtttgttg 
tggttttttt 
agagttcagt 
gtaatcaatt 
tgataatcag 
cttc 



PCT/US2003/020410 



ccgtagattt 
gtattaaatt 
cctgggattg 
cctatcattt 
ggcttcagtc 
tggcaacttt 
ttttatttgc 



tcacatccat 
gtacataaat 
gtaatgaaag 
ctgcatatcc 
tgtctatctg 
aatcgaggtt 
taatagctaa 



jg^^ c 



cataaggt^^ gcattcaata 
cccaaatgtt cttaaagaac 
gttggttttt gaaacttgaa 
agcaaaagga atctcatgtt 
ttctgagggg aaaattgtgt 
ttcaaaattc - caaggagggt 
gacaaatttg taataaagtg 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4334 



<210> 9 

<211> 488 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Leu Lys Gly lie Ser Ala Ser Leu Leu Gin Pro Asn Gin Met Pro Ser 
1 5 10 15 

Ser Thr Ser Pro Asp Gin Gly Asp Asp Leu Glu Asn Cys lie Leu Arg 
20 25 30 

Phe Ser Asp Leu Asp Leu Lys Asp Met Ser Leu lie Asn Pro Ser Ser 
35 40 45 

Ser Leu Lys Ala Glu Leu Asp Gly Ser Thr Lys Lys Lys Tyr Ser Phe 
50 55 60 

Ala Lys Lys Lys Ala Phe Ala Leu Phe Val Lys Thr Lys Glu Val Pro 
65 70 75 80 

Thr Lys Arg Ser Phe Glu Cys Lys Glu Lys Leu Trp Lys Cys Cys Arg 
85 90 " 95 

Gin Leu Phe Thr Asp Gin Thr Ser He His Arg His Val Ala Thr Gin 
100 105 HO 

His Ala Asp Glu He Tyr His Gin Thr Ala Ser He Leu Lys Gin Leu 
115 120 125 

Ala Val Thr Leu Ser Thr Ser Lys Ser Leu Ser Ser Ala Asp Glu Lys 
130 135 140 

Asn Pro Leu Lys Glu Cys Leu Pro His Ser His Asp Val Ser Ala Trp 
145 150 155 160 

Leu Pro Asp He Ser Cys Phe Asn Pro Asp Glu Leu He Ser Gly Gin 
165 170 175 

Gly Ser Glu Glu Gly Glu Val Leu Leu Tyr Tyr Cys Tyr His Asp Leu 
180 185 " 190 

Glu Asp Pro Gin Trp He Cys Ala Trp Gin Thr Ala Leu Cys Gin His 
195 200 205 

Leu His Leu Thr Gly Lys He Arg He Ala Ala Glu Gly He Asn Gly 
210 215 220 

Thr Val Gly Gly Ser Lys Leu Ala Thr Arg Leu Tyr Val Glu Val Met 
225 230 235 240 
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Leu Ser Phe eu Phe Lys Asp Asp Leu Cys Lys AslJ&p Phe Lys 

245 250 ^ 255 

Thr Ser Lys Gly Gly Ala His Cys Phe Pro Glu Leu Arg Val Gly Val 
2 60 265 270 

Phe Glu Glu He Val Pro Met Gly He Ser Pro Lys Lys He Ser Tyr 
275 280 285 

Lys Lys Pro Gly He His Leu Ser Pro Gly Glu Phe His Lys Glu Val 
290 295 300 

Glu Lys Phe Leu Ser Gin Ala Asn Gin Glu Gin Ser Asp Thr He Leu 
305 310 315 320 

Leu Asp Cys Arg Asn Phe Tyr Glu Ser Lys He Gly Arg Phe Gin Gly 
325 330 335 

Cys Leu Ala Pro Asp He Arg Lys Phe Ser Tyr Phe Pro Ser Tyr Val 
340 345 350 

Asp Lys Asn Leu Glu Leu Phe Arg Glu Lys Arg Val Leu Met Tyr Cys 
355 360 365 

Thr Gly Gly He Arg Cys Glu Arg Gly Ser Ala Tyr Leu Lys Ala Lys 
370 375 380 

Gly Val Cys Lys Glu Val Phe Gin Leu Lys Gly Gly He His Lys Tyr 
385 390 395 400 

Leu Glu Glu Phe Pro Asp Gly Phe Tyr Lys Gly Lys Leu Phe Val Phe 
405 410 415 

Asp Glu Arg Tyr Ala Leu Ser Tyr Asn Ser Asp Val Val Ser Glu Cys 
420 425 430 

Ser Tyr Cys Gly Ala Arg Trp Asp Gin Tyr Lys Leu Cys Ser Thr Pro 
435 440 445 

Pro Val Pro Pro Ala Arg Phe Asp Leu Pro Cys Leu Ser Arg Thr Arg 
450 455 460 

He His Ser Leu Leu Cys His Met Ser Arg Gin Gly Glu Gin Glu Ser 
465 470 475 480 

Phe Arg Pro Tyr Ala Arg Gin Leu 
485 



<210> 10 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 
<223> 



expression primer 



<400> 10 

ctgtgtcagc acctgcacct c 



21 



12 
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<210> 




/Oils 




<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


expression primer 


<400> 


11 


atccccatgg gcacgatttc 


<210> 


12 


<211> 


22 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


expression primer 


<400> 


12 


tttccagaat tgcgtgttgg tg 
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20 



22 



<210> 13 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> expression primer 
<400> 13 

tggatgccac ccttgagctg 



20 



<210> 14 

<211> 4888 

<212> DNA 

<213> Mus musculus 



<400> 14 

aacccaatta 

tgtgtatcac 

gagcacaata 

cacgcttagg 

gtcgagcttc 

cctaacacta 

tttatggcaa 

atctgctcgg 

aaacaccccg 

taaaagaggg 

gacctctgtc 

ggagggctaa 

gaggcagcag 

tcagtgtcgc 

atgactgaag 

ccgagccagc 

ccgttcccgg 

gccgagagca 

gtgcgcagcc 



aacttggggg 
ctctgagaca 
tctcatttag 
tttaagaagt 
gaacctaaac 
acagctccaa 
cacaacgacg 
gttttcctgc 
tgcctgttta 
gaatcagcgg 
atagctttgt 
aaggaaaaca 
cagacagaca 
gggaccccag 
gagatggggt 
agcctccctt 
gagactgaag 
cttaccatcg 
gatctgagtt 



agggagggct 
ttagcagttg 
ccccccgtag 
catatgaccc 
ccttcggttc 
gtgtgttctt 
tccttgccgg 
actttcttat 
ttagtgacct 
gtggcaggtg 
tcttttcctg 
ctggtccact 
cacgtgcggt 
gtcggatttg 
agcagggagg 
tggggctccc 
gcgagcccgg 
ttctcgtagc 
gctagccgcc 



ttttttctca 
acaaaataaa 
aaaccaaatg 
tgacaagatc 
taaagttttt 
gaaaggcaat 
cgaccatcaa 
aagggaaata 
ctggggtgtt 
gaggagaagg 
tcttcccagg 
agatatatct 
ttccatgatc 
ctcacgggat 
ccgtcgcttt 
cgagccctcc 
cctcggttct 
tgtgccgcct 
tgcaacccca 



acgatgtcag 
catcatcatg 
gcttagtatt 
acatagtaaa 
tcttgcctta 
aatggtgcaa 
aactgacatg 
tgcttaaagc 
ggtattcttt 
gctcagattg 
gtcctttcca 
cctccactaa 
agtagtgagg 
tcaccaggcc 
cccgaagcca 
cggggagaaa 
cacccggacg 
ccgcgacatg 
gaaccgtcgg 



catttatctg 
aactactggt 
atccacagtc 
taatcgagat 
accctttgga 
caaaacaaaa 
agaattaata 
caatccaaca 
ttttgaagcc 
caatacatcc 
agtacaccct 
gtgatcctaa 
gattaagtcc 
ttcctttccc 
ctatccacaa 
ggattcctat 
cagtggccag 
ctgcttgctg 
gctgccggcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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ggccggtgcg cg^^fagcgg cgaggcgcgc ggactgagcg gcgcgc^Mb gcactcgctt 1200 

acgacgcctg ccggaagcgc gtgcagaggg gcaccggatg ggctccgffag tggaaggccg 1260 

gcgtctgcaa gtccaattgc ggccgcctcc tccggctttc caccccgggc ccttctggac 1320 

agcgagctaa attatccggg ccgcagcgat cccgcctcgg ctttatacag gcccggcggg 1380 

attttcccgg cggcccctgc gccgacgact taggagactt ccggagaaag gtgacggctc 1440 

agggactctc tgcgcacgcg cgcgccgggg ggcgtgggct cgagggcggg gagcagcgcc 1500 

aagttggtga agggagaccc aggaaaggcc tagggtttgc gggtaggtgt tctctctttc 1560 

tctctcgcca tcccgctgta gacgcctcgg gagacctgag cggggaggag ggcaagccac 1620 

tttcagcggt agggaatcca ggcccgtccg gctgaatctg ctcactgtag tcctactgcc 1680 

tacttctccc agctgacgga cgccccagct actgatcggt gtggttccaa ttgttctatt 1740 

cctctgcacg tgtacagtat gtggacgtcc tctgccctgt gatggacacc aaactgtctt 1800 

gtccatggtc tagcccaaac ttttccttca gtcttacctg ttacagtaaa tggccgtttc 1860 

ggtagtttgc ttaaatccaa gaatttaggg aggagtcatt agctccttca cttcctgtgt 1920 

tacagtttat ccactaacac tgcctgctat actctgagag caagtctagt caaacttatt 1980 

tactatgctt aaaactcttc acagacttaa agcaatattt tatgtatttg taagtctgta 2040 

catcatcctg gaactggagt ttcagactct ctgtgagcaa ggaggagagt gcgcgggagc 2100 

ggggtggggg tgctgaatcc cgggtcttct gctagagcag taagcactct taaccagtga 2160 

gccatcttgc catctttccg gctctacttc tctttgttta ttttaaagat tatcgttttt 2220 

ctattctttt catttattta ttcgtacatg caagcaatcc atctttccag gaatttccac 2280 

tgcgaattcc aaactatggc ttcttgggtt ctgttcccgt cattgtgtgt aaaccaccat 2340 

gttcttctgc taaagtccag cttgttcaca ttttaggaat caagacttat tttctttatc 2400 

tgttggtctc tgctctgaaa tttacatgac taactgtcag ggacctacag agttatttct 24 60 

ctgactcctc aactcttgct ttatttcatt cagcaagtac tatattgtat taattaacat 2520 

ttgtcaccgt ctgtacatgg gacaacgaga cggtgttcag gatactgaag ccctaagtga 2580 

ccctctcagg aatgatcaca ggtactgtag tctgacggac ttaatagcaa agggcgagtg 2640 

gtacaaatta tcactcatcc tttgctattg aggacagatt atccagagta agactttgga 2700 

agagtcctcc aagaaaggat tttaatatat tctcatctgg ctccagttgc ctgaaatgag 27 60 

ccaattcgag ccagatgaga atatattaaa aatgggttta ttgggaatct gctctcaggt 2820 

gagttcactg accgctacca ggattgagac cagggaagtc actatgggga tgggggtgga 2880 

ggagggaaga acacaggagc aaagaaagaa gggaaggagg cccaacaagg aggccaaaag 2940 

gtctggattg tatagagagg agcctctggg ggaaggttca gggttgggac agggtatacc 3000 

agatagggac tatgggatac tgggagaacc tgaagaccat gtctgctttg atatgtaaaa 3060 

tatgcacctt ggtccaaggt tagaaaccaa accttaacca ttactaaatc agataagaag 3120 

tgagtgtagt tcaaagtgga ttgctgactt tcaaaggatg gcaaagcatt gggcctggta 3180 

tagtggcccc gaattctaga agtagaggca aggggaatct taaatttagg attactagaa 3240 

cctcaaggcc agcccctact atatagcaag acccagactc agaaaaatta atttaaaaac 3300 

taggggttgg cccaggcagc atacaccagc tgataggagg ccccagacac atatatagca 3360 

gaggactgtg tggtctggcc ttagtgaggg aagatgcacc taaccctcaa gaaacttgag 3420 

gccccaggga gtgggggcat tctcttgggc aggggacaga gaaggtgtgg gatgaggaac 3480 

agagggcaga ctggaaggtg gataaagact agactgtaaa aaagattcaa gaataaaatt 3540 

ttcttaattt tgttggaaaa aaaaaatagg ggttggcaag atggctctgt gggcaaaccc 3600 

ctgccactca agcctgacaa tacaacgtgt gtttgatccc tggagcctgt gcagatatgg 3660 

aactggcagg catctcggca caagtagcca cagaagtaaa tcaaatgtaa caaagaagac 3720 

tgcattggtt atagtgtatc cattttggta agatttggcc acagtagcaa tttggaaatt 3780 

ttggctctga gatttgagaa caaaagaaac aaagctacag aaacactgga gttggttgat 3840 

ttgggaagtc catagtgctt ttctgctctc atgtgctgcc ctgtgttttg ttttcccctg 3900 

gcttacttcc tagttcatgt ttctttgaag atgagtgcta ttgcctcaga agggctcttc 3960 

ctgtgtatga ggagcccttc cacttgcctc cttactccac tcattttcct ttgaaatatt 4 020 

taaaaaaaaa aaagaaaaga aaagaaaagt caactcaaat caatgtatcg gcttaccttt 4 080 

cctagtagat tgtaaactta agaggataga ctttttccta ttgcatttac tgctctattc 414 0 

ctggtgtgta gaagagagcc tggtagagca tagttgtcac ataaatacac tgtggagtga 4200 

gcgaggggac gaacggagag cttggggtag gaggctactg gtcatttcat gagtccatta 4260 

gcttcatgtc tgcttcaaga agtaaccaaa gagctgattg ctatattctc ttttgcttac 4320 

gtctcagaag atacttgtct gcaaggaacc tgtcccactg ataaagccag atgccttctt 4380 

ccacttcacc agacgaagag gatggcctgg agacctgtgt tttaaaggtt tttgatctgg 4 440 

atttaaaaga atcaaatctt gttaatccca gcaacagtct caaagcagag ttagatggca 4500 

gcacaaagaa aaaatactcg tttgcaaaga aaaaggcctt tgcccttttg gtcaaaacca 4560 

aacaagttcc agcaccctct tatgaattta agggaaaacg gtggcgatgt tgtcagcagc 4 620 

tgtttgcaga ccagatcagc atccacagac atgtggccac acagcatgct gaagacgtgt 4 680 

accagcagac tgcgtctctt ctgaagcagc tgactgcagc attgagtgcc tcacagagcc 4740 

ttacgcccac agacaaaagg agctccccca aagactgtct cactcctagt caggaggtgt 4800 
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ctgcttggct tcc«^gtg agccatgtta gcccccagga gctgaggalj^ggccagggtg 
acgaggaagg agaggtgctc ctgtatta \ ** ~ " " 4888 



4860 



<210> 15 

<211> 7551 

<212> DNA 

<213> Homo sapiens 



<400> 15 

gagagacaaa cgttagtaga taaaatttac tcaatttaaa atgtctgtta tgggtttttt 60 

tcttacattg acagcatctg ctaacgttta ccgcctctta agcgttggca aatgataaaa 120 

catcattgtt gctgatgtgt attacctcat ttaaccttca caaaaaccag aagatataaa 180 

tatcattact tctgtattac agactaaagt ttaaggagat ttcataacct ggacaagatc 240 

accaagtaaa tggtagggtc tggctttgaa cctaaaccct ctggttccaa agtctcatct 300 

cttaaccact actatacact ctcttcaaaa aacaatacac taaaatgtta acaatagaat 360 

tactctaagt tttacttttt tggtgtgtgg atttgtgttt ttgaccgatt gtaaggctta 420 

tgtgtaatat aagcatcttt tttttgttgt tgttaaaact agaattattc catctttatg 4 80 

aaggcaacac tgatctccac gagcacaatc atccaaacgg agatgggaac ccctgttcta 540 

actctctcag aggcctgaag ttagttttct agcactctct ctcacacaag ggaaatgcgc 600 

ttacggccag ttcaaaatct gcagtaacct gtgtctgttt attagggacc tcttgggcgc 660 

aggtaagaca atattctttc cctgcagctt gaaagaagtg aaatcaacgg ataaccacta 720 

ccggaacaga gttcagactg caactcatga gatctctcag tcctttcctt cagtgaattc 780 

ttccgagccc ttttcaggtg cagctttgaa ggttaaaacg agacactagg accagtgggt 840 

ccctaagagg cggcagcaca gacacaggtg tgcttccctg ctccgtggtt ttacggacga 900 

aattctgaag catccttaga gaccaagggt ctacccatgg ggttcaccag gccttctttc 960 

ccgaggatcg gcccttgact caaagaactg gggcggggga gaagtcgctt tctagaagcc 1020 

gcccgaccac accgccacca tattctcctc tttccctccg cggctccccg cacgttcccg 1080 

gggagaagag cttccagtct cttccgtgga cacccaaaga cgccgagcct cggctcccaa 1140 

ccgggagcgg cctccgtggc cgggccgcag gcactcaccg tcgttctcgt cgctgtgccg 1200 

ccgccgcgac atgctgccct ccggtgcgcc gcggaaccga gaggccaggc ggtaagcgct 12 60 

gcaggaactg tctggccgct ggccgacgca aggacagctg caaggcgcgc ggatgggccg 1320 

gcacgcaggc gcactagctc gccacggccc cggaagcgca ggagaggccg ccgggtgggg 1380 

ctaggcgctg cgacagccgg cgtgaggaag ctcagtgggc tacgaacgtc tggcacacat 14 4 0 

gcaaccgccc cctcgggctg cctccgcctg ccggctactt ctttctcccg ccttccgctc 1500 

tatgtccttc cggggccacg cagaaagtgc cgccgctttg gccactcaga gcccccgggc 1560 

cgcggtcgtc gtacgcctga aggcgggtcg tgccggcggc cgctctagtc tccgcctccg 1620 

ctcaggccgg tcctccgggg cttctcaatg gtttcccggt ggcctctcaa tggttttccc 1680 

ggcggccctt gcgccgacgc caggagactt ccggagcttg gtgacgtcac gagcgagctt 1740 

ttctacccaa atacgcggcg ggggaatagg ctcgagggcg gtgagcagtg acaattgcta 1800 
ggcggagaca gtgcagggaa gagagacctt agaaaggatc aggactggcg ggtatgtgct . 1860 

catctactcc cactttccgg cttttgccgc cttgggaaaa gtgggaggag aggttgggcc 1920 

aagctcggca tgcggggtgg ggcctgggcg ggaggcggtg ccgcacgtgc cgccccttgg 1980 

tatggaaagg ccggccctga cgcgagcgtg cggctccggg cttgccggct ggccgtcatt 204 0 

ttccttaagt tgtttgctta ggaaggaaca aatgattgtt ttagtaatct gttttcaagg 2100 

gttattgggt acctatatgc ctgtgtgtga gcgtcccctc ccctgatgtc tttatgagca 2160 

ccatactggc ttatctgctt tgtccatggc catctgaaat ttcactctcc cggcctaatt 2220 

ctccttcggt cttacccatt tcagtaaatg acgtcatccg tcggccgctt aatccaaaaa 2280 

tttagagtct ttcttagctc ttttccctct ttttacataa caattatctg taaaccctgc 2340 

ctgctctgct ttcaacacaa atccagaatc agatcgtgtt actgctttgt ttaaaagtct 24 00 

cctatgacgg ccgggcgggg tagctaacgc ttgtaatcct aacacttatg ggaggccgac 24 60 

gcgggcggat ccgcttgagc ctaggagttc aagacccacc tgggcaacaa ggcgaacacc 2520 

gtctttacag gaaaaaaaaa aaaattagct gagcctagtg tttggcgcct gtagtcccag 2580 

ttactcgagg ggagggttgg gggccgaggt gggaggatcc cttgagccca ggaggtcgag 2 640 

aatgcagtga gcggtgatcg cacactgcac tccagcctgg gcaacaaagt gagaccgtgt 2700 

caaaaaaaaa aagtctccta cgactttcca ttgcacactc catgccatgg cctgtggggt 2760 

tctacttctc ctcactgtac tccaaccact ctttctgttc ttacaataag tcaagcttgt 2820 
tcttatttta ggaacttata ctatttcctt tacctgaagg tctctggttc taaatctgcg * 2880 

tggctggctc ttggtcactt aggtctcagc tcaaatgtca ggtccttagt gaggccttct 2 940 

ttggttaccc aatcactgtt ttatttcatt caacgaatat tatatttatt agcatttagc 3000 

acctcctagg tgccaggcag agttctgaat gctgaggatg tagaggtgaa caaaggaaaa 3060 
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cccctgctct 
gccatcatag 
gtgctctgtg 
tttttttttt 
tggctcgttg 
tagcggcaat 
tggggtttcg 
ttacaggcat 
atgctcccaa 
agcttgaact 
caaaagcaaa 
cgaggtgggt 
ctgtctctac 
tcccagctac 
agtgagccga 
aaaaagtatt 
ccaaagtttt 
agactgggga 
cttcacacac 
aggcccaggt 
tttcctttat 
ggaaaattta 
ggtaatctgg 
tttaagtcct 
catttagagg 
tatccctagc 
ttaatgtctc 
tcatagctgc 
aataataatt 
gtcggaggag 
tgagtcagga 
aggatatctg 
aagtttttca 
aggtgacact 
aaaaagtaag 
atatagaggg 
cgacaatgtt 
acaatgcctg 
aatgatatat 
gttgaataag 
agttagggat 
aagagaagct 
caagagaatt 
ttttgagccc 
acatcttaat 
tgtgtgtctg 
gtgttccccc 
cagagttgtg 
tagtggtgct 
tcagaacctt 
tttgccagtt 
acagggcaca 
aagggtctag 
ctctgtgtgt 
cctcagcgtg 
ctcagtcata 
ctaggcttag 
tcacagctaa 
atattaataa 
ttctttccag 
gagagattta 



ct^p^gctg 

gtagtgagca 
aggactttgg 
gagacagggt 
taacctctgc 
tacagacatg 
ccatgttggc 
gagcctctgt 
aggaggaaga 
agggtggtag 
gctggccagg 
ggatcacaag 
taaaaaaaaa 
tcgggaggct 
gatcacccca 
ttctgaacaa 
ttttgtctga 
agacgaaagt 
agtgtaaaat 
tgaagatata 
gtttcccttt 
tgatgaaacc 
tttcattgat 
tggaagtaga 
ttggggtaat 
tggtattttc 
tccttaccag 
attcccagga 
cagagtatgt 
atcattttgg 
aagaacatgg 
atggggaatg 
ggccttcaaa 
tgagttcagt 
ggaaacaaag 
agattggaaa 
acctcccaaa 
gaacctagaa 
gtactttctc 
attcttctac 
gtactttttt 
aaagaaggta 
taggtttaat 
catgaaagca 
ttatgaatcc 
tgtatgttta 
acacataaac 
aattggcaaa 
ttttctgctc 
cagtgtccac 
cactagtaat 
acagggacac 
tggttgatgc 
ggtctctgca 
agcatctcag 
cagcattgct 
ggaaatggac 
tttttaaata 
tattaccatt 
tctacagtac 
gaactcagaa 



atatattcta 
acgggagagt 
acattatcca 
ctcgctctgt 
cttttaggtt 
cgccaccaca 
caggctggtc 
gcagggccag 
tagttcagct 
taaagatggt 
catggtggct 
gtctggagat 
aatacaaaaa 
gaggcaggag 
ctgcactcca 
gtgaatgata 
ataacaggaa 
ttggttttga 
tcacacacag 
aatatggagt 
tttcttttct 
ccatgtactc 
attacctcct 
tgagatcacc 
aaggaagatc 
atgatttgtt 
aatataagct 
ctattccatg 
acataaatca 
ggtggtggga 
ctcatttgtg 
gggagcatag 
ggaagcagtg 
cttcatatac 
cagtggaaag 
agatgctgga 
tgtactgccc 
gaccctccat 
tacctttaca 
ttattttact 
cagtgggact 
gtatttcttc 
aaatctgtat 
ccacttagct 
aaggaacaaa 
gtctccatca 
tgctccttgt 
gtttgtcacc 
cctcttgtgg 
ctctggctgc 
atgaaataac 
cttgtctctg 
tggctgtcag 
tgggttggat 
aatgagagct 
tccatcacgc 
cagaaatact 
aaattttaat 
cattgaaccc 
agtactgtaa 
accttaagta 



acgacccatt 
ggtagatgag 
agtgaattga 
tgcccaggct 
caggcaattc 
ccaggctaat 
ttgaactcct 
tctgacttat 
tttgcagtaa 
ggtgcaaagt 
cacccctgta 
tgagaccatc 
ttagctgcgt 
aatcgcttga 
gcctggtgac 
gtgtgagaaa 
gaataaaatt 
acagttgtat 

tggggtatat 

catttattat 
catgtaattt 
atcatcaact 
ctctctcaat 
tgtagggtaa 
cagcaaagga 
tgtttgtttg 
ccaagagagc 
ggcccggcac 
aaataagctg 
aaggcacccc 
gaactgggca 
tcagagatga 
acacagggta 
atagtaagca 
cataaatgct 
ggtgggaccg 
ttataggtaa 
aaatatttgt 
ataaaacagc 
tctgcctgga 
ttatatgaaa 
acccagtctc 
taccttgcct 
gattcaaatg 
ggtcctttat 
gtcctatcca 
tgccttcacc 
agtgtctccc 
agtttgggac 
ctccttcaat 
cagttgttat 
ctctcagctc 
ctggaggcct 
tggcttactc 
tgcactagcg 
tttatttttg 
gctgcagcca 
gttaaattta 
tagtttgtgc 
tgtagatact 
gcttgcccat 
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cagga 

atcagagflgg 
gaagtcattg 
ggagtgcagt 
ttgtgcctta 
ttgtatattt 
ggcctcccaa 
ttttttaata 
ttctagctgc 
gatctgattg 
atcctagcac 
ctggccaaca 
gtggtggcgt 
acctgggagg 
agaacgaaaa 
ataaggagtc 
gtcttttact 
ttgaagctga 
gaatttgtag 
atagtattct 
tcaaatattt 
tcagtaattc 
tattttgagg 
gtatagatag 
aactgagaag 
tttgtttgtt 
aggtacttca 
atagcagagg 
agactgaaaa 
acactcaggt 
tggtccagtg 
gcctagagta 
gcagacaaga 
tatggggagg 
gggacagact 
gattggatag 
ataagttatc 
tgaatgaatt 
tttgagcatg 
aggcttacag 
gacattttca 
ttccacttat 
tacccttata 
cattttatgt 
aagaaagttg 
gcattgttaa 
tggttgaggt 
accttccttt 
tactttgcat 
attgtcctga 
gttgtggggc 
agaggccaat 
ctgttcctct 
acaatacagt 
tatccttttg 
ggagtagtta 
tttttggaaa 
aaatacagca 
cagacactat 
acaatgcaca 
gttcacatgc 



cacaaatagt 
tagggggaag 
gagtatgatt 
ggcacggtct 
gccacccaag 
ttagtagaga 
agtgctgaga 
gttacttctc 
aaatgatagt 
gattgtactt 
tttgggagtc 
tggtgaaacc 
gcagctgtac 
cggaggttgc 
ccgtctcaaa 
aggaatcatc 
cagataagga 
agctagctgg 
tttgggagaa 
aaacagaaaa 
acagaagtag 
ttaattcatg 
caaatatagt 
agtggttcaa 
caggggccgt 
tattacttac 
actgtttcat 
tgccacaaat 
aagatgcaga 
gcagagacct 
tggctagagt 
tagggaaact 
cataagtgaa 
gaaagaagtg 
acatcaggtc 
taattccttt 
tgtttctaga 
ttaatttgga 
gaggaagtgt 
gtttgtgtag 
tgagaaagga 
ttttcaaata 
acaaagctca 
ggtgcaagtg 
gtattgcctg 
caagcagtgt 
accctgtagg 
ctctcattct 
gttactattt 
agaaactaac 
aggaatttgg 
ctggaatctg 
ccacctggtt 
ggtttggtct 
agaacccaac 
acaagcctgc 
atataatctg 
ttcaattaaa 
atattaatat 
atttccacaa 
ttggcaaact 



3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 
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gctgcagcac aa\Agc ccatactttt tctcagtgtg cctctt^^^cttgtcacct 6780 

gtcacctttg cttt^aaaa gaaccaacat ggtatgatgg ttgtagtgtg tccattttga 6840 

caagatttgg ccacagtagc gattttgaca tttcttctgg catattacag tggggctgtc 6900 

ttgggaacta tttcttctaa ggaaggaaat tttcatagct ctgaaattaa ggggcaaaaa 6960 

gaagaaagtg tttataactg aaggtgctgg cacacatctt gattgatttg ggttgtacct 7020 

gttgcttttc tggtattatt taccctaatg actgcatggc tcgctcccta attcaggttt 7080 

ctgctcaaat gctattgctg cagaaaggct tttcctactc tctgtgttcg aagggcttct 7140 

ccatcactgc ctccctaccc tgctgattat tcttcataac ctttacacca tctgacttta 7200 

aaaataaaaa atcaatttat ccacttgttt acttcacccc actggaatgt aaactctaag 7260 

caggcagaaa ctttttcact ttcgtttact gctatattct cagtgcttag aatagtgctt 7320 

gacttcgagt cgttcctaag taaatactat tgatttagta agtaaatgga tggttagctt 7380 

gggctacttt gcccatgctg atttgcgtga ttgattagct tcatattttc ttcaagaaat 7440 

caaagaatag catggctgct tctttcttat ttatattcta ggaggtattt aactgaaagg 7500 

aatatctgct tcactgttgc aaccaaacca gatgccttct tccacttcac c " "* ~ 7551 

<210> 16 

<211> 488 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Leu Lys Gly He Ser Ala Ser Leu Leu Gin Pro Asn Gin Met Pro Ser 
1 5 10 15 

Ser Thr Ser Pro Asp Gin Gly Asp Asp Leu Glu Asn Cys He Leu Arq 
20 25 30 

Phe Ser Asp Leu Asp Leu Lys Asp Met Ser Leu He Asn Pro Ser Ser 
35 40 45 

Ser Leu Lys Ala Glu Leu Asp Gly Ser Thr Lys Lys Lys Tyr Ser Phe 
50 55 60 

Ala Lys Lys Lys Ala Phe Ala Leu Phe Val Lys Thr Lys Glu Val Pro 
65 7 0 75 eo 

Thr Lys Arg Ser Phe Glu Cys Lys Glu Lys Leu Trp Lys Cys Cys Arg 
85 90 95 

Gin Leu Phe Thr Asp Gin Thr Ser He His Arg His Val Ala Thr Gin 
100 105 no 

His Ala Asp Glu He Tyr His Gin Thr Ala Ser He Leu Lys Gin Leu 
115 120 125 

Ala Val Thr Leu Ser Thr Ser Lys Ser Leu Ser Ser Ala Asp Glu Lys 
130 135 140 

Asn Pro Leu Lys Glu Cys Leu Pro His Ser His Asp Val Ser Ala Trp 
145 150 155 ^ i 6 o 

Leu Pro Asp He Ser Cys Phe Asn Pro Asp Glu Leu He Ser Gly Gin 
165 170 175 

Gly Ser Glu Glu Gly Glu Val Leu Leu Tyr Tyr Cys Tyr His Asp Leu 
180 185 190 

Glu Asp Pro Gin Trp He Cys Ala Trp Gin Thr Ala Leu Cys Gin His 
195 200 205 
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Leu His Leu Th^ly Lys lie Arg He Ala Ala Glu G3*» e Asn Gly 

210 215 220 y 

Thr Val Gly Gly Ser Lys Leu Ala Thr Arg Leu Tyr Val Glu Val Met 
225 230 235 240 

Leu Ser Phe Pro Leu Phe Lys Asp Asp Leu Cys Lys Asp Asp Phe Lys 
245 250 255 

Thr Ser Lys Gly Gly Ala His Cys Phe Pro Glu Leu Arg Val Gly Val 
260 265 270 

Phe Glu Glu He Val Pro Met Gly He Ser Pro Lys Lys He Ser Tyr 
275 280 285 

Lys Lys Pro Gly He His Leu Ser Pro Gly Glu Phe His Lys Glu Val 
290 295 " 300 

Glu Lys Phe Leu Ser Gin Ala Asn Gin Glu Gin Ser Asp Thr He Leu 
305 310 315 320 

Leu Asp Cys Arg Asn Phe Tyr Glu Ser Lys He Gly Arg Phe Gin Gly 
325 330 335 

Cys Leu Ala Pro Asp He Arg Lys Phe Ser Tyr Phe Pro Ser Tyr Val 
340 345 350 

Asp Lys Asn Leu Glu Leu Phe Arg Glu Lys Arg Val Leu Met Tyr Cys 
355 360 365 

Thr Gly Gly He Arg Cys Glu Arg Gly Ser Ala Tyr Leu Lys Ala Lys 
370 375 380 

Gly Val Cys Lys Glu Val Phe Gin Leu Lys Gly Gly He His Lys Tyr 
385 390 395 400 

Leu Glu Glu Phe Pro Asp Gly Phe Tyr Lys Gly Lys Leu Phe Val Phe 
405 410 415 

Asp Glu Arg Tyr Ala Leu Ser Tyr Asn Ser Asp Val Val Ser Glu Cys 
420 425 430 

Ser Tyr Cys Gly Ala Arg Trp Asp Gin Tyr Lys Leu Cys Ser Thr Pro 
435 440 445 

Pro Val Pro Pro Ala Arg Phe Asp Leu Pro Cys Leu Ser Arg Thr Arg 
450 455 460 

He His Ser Leu Leu Cys His Met Ser Arg Gin Gly Glu Gin Glu Ser 
465 470 475 480 

Phe Arg Pro Tyr Ala Arg Gin Leu 
485 



<210> 17 

<211> 529 

<212> PRT 

<213> Homo sapiens 

<400> 17 
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Leu Lys Gly I^H^ Ala Ser Leu Leu Gin Pro Asn Gin IK 1 ' Pro Ser 

1 T 10 15 

Ser Thr Ser Pro Asp Gin Gly Asp Asp Leu Glu Asn Cys He Leu Arg 
20 25 30 

Phe Ser Asp Leu Asp Leu Lys Asp Met Ser Leu He Asn Pro Ser Ser 
35 40 45 

Ser Leu Lys Ala Glu Leu Asp Gly Ser Thr Lys Lys Lys Tyr Ser Phe 
50 55 60 

Ala Lys Lys Lys Ala Phe Ala Leu Phe Val Lys Thr Lys Glu Val Pro 
65 70 75 ' 80 

Thr Lys Arg Ser Phe Glu Cys Lys Glu Lys Leu Trp Lys Cys Cys Arg 
85 90 95 

Gin Leu Phe Thr Asp Gin Thr Ser He His Arg His Val Ala Thr Gin 
100 105 no 

His Ala Asp Glu He Tyr His Gin Thr Ala Ser He Leu Lys Gin Leu 
115 120 125 

Ala Val Thr Leu Ser Thr Ser Lys Ser Leu Ser Ser Ala Asp Glu Lys 
130 135 140 

Asn Pro Leu Lys Glu Cys Leu Pro His Ser His Asp Val Ser Ala Trp 
145 150 155 16 o 

Leu Pro Asp He Ser Cys Phe Asn Pro Asp Glu Leu He Ser Gly Gin 
165 170 175 

Gly Ser Glu Glu Gly Glu Val Leu Leu Tyr Tyr Cys Tyr His Asp Leu 
180 185 190 

Glu Asp Pro Gin Trp He Cys Ala Trp Gin Thr Ala Leu Cys Gin His 
195 200 205 

Leu His Leu Thr Gly Lys He Arg He Ala Ala Glu Gly He Asn Glv 
210 215 220 

Thr Val Gly Gly Ser Lys Leu Ala Thr Arg Leu Tyr Val Glu Val Met 
225 230 235 240 

Leu Ser Phe Pro Leu Phe Lys Asp Asp Leu Cys Lys Asp Asp Phe Lys 
245 250 ' 255 

Thr Ser Lys Gly Gly Ala His Cys Phe Pro Glu Leu Arg Val Gly Val 
260 265 270 

Phe Glu Glu He Val Pro Met Gly He Ser Pro Lys Lys He Ser Tyr 
275 280 285 

Lys Lys Pro Gly He His Leu Ser Pro Gly Glu Phe His Lys Glu Val 
290 295 300 

Glu Lys Phe Leu Ser Gin Ala Asn Gin Glu Gin Ser Asp Thr He Leu 
305 310 315 320 

Leu Asp Cys Arg Asn Phe Tyr Glu Ser Lys He Gly Arg Phe Gin Gly 
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325 330 335 

Cys Leu Ala Pro Asp lie Arg Lys Phe Ser Tyr Phe Pro Ser Tyr Val 
340 345 350 

Asp Lys Asn Leu Glu Leu Phe Arg Glu Lys Arg Val Leu Met Tyr Cys 
355 360 365 

Thr Gly Gly He Arg Cys Glu Arg Gly Ser Ala Tyr Leu Lys Ala Lvs 
370 375 380 

Gly Val Cys Lys Glu Val Phe Gin Leu Lys Gly Gly He His Lvs Tvr 
385 390 395 Y 400 

Leu Glu Glu Phe Pro Asp Gly Phe Tyr Lys Gly Lys Leu Phe Val Phe 
405 410 415 

Asp Glu Arg Tyr Ala Leu Ser Tyr Asn Ser Asp Val Val Ser Glu Cys 
420 425 430 

Ser Tyr Cys Gly Ala Arg Trp Asp Gin Tyr Lys Leu Cys Ser Thr Pro 
435 440 445 

Gin Cys Arg Gin Leu Val Leu Thr Cys Pro Ala Cys Gin Gly Gin Glv 
450 455 460 

Phe Thr Ala Cys Cys Val Thr Cys Gin Asp Lys Gly Ser Arg Lvs Val 
465 470 475 ^ 480 

Ser Gly Pro Met Gin Asp Ser Phe Lys Glu Glu Cys Glu Cys Thr Ala 
485 490 495 

Arg Arg Pro Arg He Pro Arg Glu Leu Leu Gin His Val Arg Gin Pro 
500 505 510 

Val Ser Pro Glu Pro Gly Pro Asp Ala Asp Glu Asp Gly Pro Val Leu 
515 520 525 

Met 



<210> 18 

<211> 555 

<212> DNA 

<213> Homo sapiens 

<400> 18 

aaggcgcgcg gatgggccgg cacgcaggcg cactagctcg ccacggcccc ggaagcgcag 

gagaggccgc cgggtggggc taggcgctgc gacagccggc gtgaggaagc tcagtgggct 120 

acgaacgtct ggcacacatg caaccgcccc ctcgggctgc ctccgcctgc cggctacttc 180 

tttctcccgc cttccgctca tgtccttccg gggccacgca gaaagtgccg ccgctttggc 240 

cactcagagc ccccgggccg cggtcgtcgt acgcctgaag gcgggtcgtg ccggcggccg 300 

ctctagtctc cgcctccgct caggccggtc ctccggggct tctcaatggt ttcccggtgg 360 

cctctcaatg gttttcccgg cggcccttgc gccgacgcca ggagacttcc ggagcttggt 420 

gacgtcacga gcgagctttt ctacccaaat acgcggcggg ggaataggct cgagggcggt 480 

gagcagtgac aattgctagg cggagacagt gcagggaaga gagaccttag aaaggatcag 540 

gactggcggg tatgt y * y 55 5 

<210> 19 
<211> 237 
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<212> DNA 

<213> Homo sapiens 

<400> 19 

cttatttata ttctaggagg tatttaactg aaaggaatat ctgcttcact gttgcaacca 60 

aaccagatgc cttcttccac ttcaccagac caaggagatg acctggagaa ctgcatttta 120 

agattttctg acctggattt aaaagatatg agtcttatta atcccagcag cagtcttaaa 180 

gcagaattag atggcagtac aaaaaagaaa tactcgtttg caaagaaaaa ggtagaa 237 



<210> 20 

<211> 339 

<212> DNA 

<213> Homo sapiens 

<400> 20 

tgaaatctaa ttgcaggcct ttgccctttt tgtcaaaacc aaagaagttc caacaaaaag 60 

gagttttgaa tgtaaagaaa 1 aattgtggaa atgctgtcgg cagctattca cagaccaaac 120 

cagcatccat agacatgtgg caacacaaca tgctgatgaa atttatcacc agacagcttc 180 

tattttaaag caactggctg tgacattgag cacctcaaag agtctttcgt ctgcagatga 240 

aaagaaccct ttaaaagagt gccttccaca tagccatgac gtgtctgctt ggctccctga 300 

tataagctgc tttaaccctg atgagctgat aaggtaaga 339 



<210> 21 

<211> 143 

<212> DNA 

<213> Homo sapiens 

<400> 21 

gattttcatt ttatagtggc cagggcagtg aagaagggga ggtgctcctt tattactgct 60 

accatgacct ggaggatccc caatggatct gtgcctggca gacagctctg tgtcagcacc 120 

tgcacctcac aggcaaggta aca ~ ^ 



<210> 22 

<211> 14 8 

<212> DNA 

<213> Homo sapiens 



<400> 22 

ccgtcttgtg tctcagattc gaattgctgc agaaggaatc aatgggacag ttggtggaag 60 

caaattggct accagacttt atgtggaagt catgctttcc ttcccattgt ttaaggatga 120 

cctgtgtaaa gatgatttta aggtaaga 149 



<210> 23 

<211> 128 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gtttctcatt ggctagacca gcaaaggagg agctcactgt tttccagaat tgcgtgttgg 60 

tgtatttgaa gaaatcgtgc ccatggggat cagccccaaa aagatctcct acaagaagcc 120 

tggtatgc 128 

<210> 24 

<211> 141 

<212> DNA 

<213> Homo sapiens 
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<400> 24 

tttggtttgg ttttaggaat ccatttatcc ccaggtgaat ttcataaaga agtagaaaag 60 

tttttatctc aggcaaatca agaacaaagt gatactatcc ttcttgattg cagaaacttc 120 

tatgaaagca aaatagtaag t 141 



<210> 25 

<211> 181 

<212> DNA 

<213> Homo sapiens 



<400> 25 

tgctcctatg ttacagggac gattccaagg ctgcttagcc ccagacatca ggaaattcag 60 

ttacttccct agctacgttg acaaaaatct agaacttttc agagagaaga gagtgctgat 120 

gtactgtacc gggggcatcc gctgtgagcg gggttcagcc tacctcaaag ccaaggtgag 180 

c ^ 181 



<210> 26 

<211> 161 

<212> DNA 

<213> Homo sapiens 



<400> 26 

gtttttccac acctagggag tgtgcaagga ggtgttccag ctcaagggtg gcatccacaa 60 

gtacctggaa gagtttcctg atggctttta caaagggaag ttgtttgttt ttgatgaacg 120 

ctatgctctg tcctacaaca gtgatgtggt gtcaggtagg t 161 



<210> 27 

<211> 2697 

<212> DNA 

<213> Homo sapiens 



<400> 27 

tttccttcct ccccagagtg ttcatactgt ggagcccgct gggaccagta taaactctgc 60 

tctactcccc agtgccgcca gctcgttttg acctgccctg cctgtcaagg acaaggattc 120 

acagcctgtt gtgtcacatg tcaagacaag gggagcagga aagtttcagg ccctatgcaa 180 

gacagcttta aagaggaatg cgagtgcaca gcccgacggc cacgcatacc tagggaactc 240 

ttgcagcatg tgcgacagcc tgtgagccca gagccagggc ctgatgctga tgaggatggg 300 

ccagtgctta tgtgagcagc acctttggca ttttcccagg ccctcggtaa aagtaggttt 360 

ggggtgacta tacagagaaa gcatggcaag actgcagaaa cagagaaatc gggaacttca 420 

gttctggccg ctgccaccgt ggcagccgtc tacacttcac agcgggaggg gaggagtcac 480 

gttgtctacc acttacctga gacattctga tttggatgat gctagagcac agaaaatagg 540 

tgagctgcat gggatcccaa agctgctgag ggatagagcc tgagcctggt ggccacagca 600 

tatgcccttt ctgttccatg cagctggggc tgttagtagt cattgccctt gtcagcagac 660 

cttctaccct ggtggcaaac acatgaaagc tgtggccctg ggagtggcct cctaaaacaa 720 

gccacttagg tcatctgcca tctaccctta acctctgtct ctcgcctgag gggaatctgc 780 

aagctgtgca ttgggcttac ctcctgcttt tgtagaaata accatccttt ggtatacatg 84 0 

gaggatagtt ccagaacgcc tgagtataca aaaacccaafc gcatactcaa gtcccacagt 900 

gggccctaca gaacccacgt atgtgataaa tcagccctcc atgtacgcag gtttcgcccc 960 

ctgccaatac tgtattttca acctgtatgg ttgaaaaaaa tccatatata agtgcagcca 1020 

tgcagttcaa acccatattg ttcaagggtc aactgtatag tttattgaac agccacaccc 1080 

attcctttac acatgatcta tggcagagtt gaatagttgc aacagacact atgtggcctg 1140 

caaaatcgga aatttttact gtctggcctt ttacagaaaa gtttgccagc ccctgatcta 1200 

gaccagcagc tcatctgata gaggcagagg tggccttaaa gatgtggcct tcttcatttt 1260 

ctgttggttt ggtttcgttt ctatgagaga tttcctctga tagctctgct ttccccagca 1320 

cttactctct gagcttttaa atgttctctc tgggagcttc atataagctc ggtgacattt 1380 

gagccacagt ttttagatca gcacctggaa tacatgacac attcttactg aggtcatcca 144 0 

gcactgccat ggtggctgcc cagtcttctg gccagtgtgc caggcacatg tccctgtcac 1500 
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acaggttcca 
cagttttcag 
tggttgtccc 
ccatagcgct 
ttcagtgcgc 
ttgaccttgt 
caacaaacaa 
gtggcttaac 
agcagtacta 
aattccttga 
caggcgtaag 
aagctgggcc 
cttacctcac 
tggccgtaga 
ccagtattaa 
ttgcctggga 
tttcctatca 
agtggcttca 
atttggcaac 
cagttttatt 



aa^ecat 
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N 

agaa 
cgatgcagaa 
tgcaacatgg 
gaggccctgg 
aggccttcac 
cctttatcat 
aaatattagt 
acttttgctg 
atactacatt 
tatctggagt 
gtcagcagca 
attactaaat 
ataagccatt 
ttttcacatc 
attgtacata 
ttggtaatga 
tttctgcata 
gtctgtctat 
tttaatcgag 
tgctaatagc 



acgcagccat 
tgcagctgca 
tgccactcct 
ccccgcctcc 
cttttcatca 
gagatttgct 
ttaactggca 
gatcccaaga 
tcagtgtaat 
ttcttcacca 
atttgtctaa 
accatagcca 
tgcccactgt 
catcataagg 
aatcccaaat 
aaggttggtt 
tccagcaaaa 
ctgttctgag 
gttttcaaaa 
taagacaaat 



gcatagacca 
attgtgtttt 
gggccatgtg 
agttaccctg 
ccagcctctc 
gaaatcacta 
ctgtggtata 
gacgcacatg 
catcttgggg 
ttgtccggca 
taagtgatga 
tgttgatctg 
gcaatataga 
ttggcattca 
gttcttaaag 
tttgaaactt 
ggaatctcat 
gggaaaattg 
ttccaaggag 
ttgtaataaa 




acagatttaa 
aaggagaagc 
cagcctcagt 
tactgcccac 
tgctcagtgc 
atgaaaataa 
ttaaaaggca 
atgttaaaaa 
tggtttggcc 
tcctgcggag 
gatcagtagc 
gaaatttatc 
aaggtgtttt 
ataaggaaaa 
aacactcagg 
gaaatttcac 
gttgactcct 
tgttctggat 
ggttaataaa 
gtgttttata 



tattatattg 
caaatgggga 
ggacactctt 
tgccttacag 
tctggagttc 
ctcccaaaag 
caagggcatt 
gagatctggc 
aggatttccc 
gcttaatata 
tgaagtctct 
cctctagtgt 
caaaagtatt 
agttctaact 
gacatgtttg 
cattggtttt 
ggcagagttc 
ccagtaatca 
gaatgataat 
atacttc 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2697 



<210> 28 

<211> 300 

<212> DNA 

<213> Homo sapiens 

<400> 28 

gtgcttttct ctttaggtta cgagacagta caatagaagg agtatgctcg tccccattct 60 

ttcactgagt caccatatga ttttggacca gctagtgctc tagacctcag tatcccttct 120 

tataaaataa gaatgttaca gctcatgcaa tctgggactc caaatcttgg acatattagc 180 

tcacttgaga gaccaccagc ctggtcagca gatcactgtg tttttagtaa atctggaatt 240 

gtaagattaa cacttcatac cacatggggg aataaagttg ttgctctcac aggtgggctg 300 



<210> 29 

<211> 813 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gtttttccac acctagggag tgtgcaagga ggtgttccag ctcaagggtg gcatccacaa 60 

gtacctggaa gagtttcctg atggctttta caaagggaag ttgtttgttt ttgatgaacg 120 

ctatgctctg tcctacaaca gtgatgtggt gtcaggtagg tcagcacagg ctcagagccc 180 

aaactgaaat gaagcacatt gtcagttcac tattctagaa aaatgacaca gggaagacag 240 

gccagtgctc attactgagc actgaataag cagggaaaat aagtacattg tgccaccatt 300 

ttcccagctg tggagctgag agaaccctag cccaggagtc aggaggcctg ggttgggatc 360 

ctggcttcac cattgctagc tggacaagcc cattaacatg gggatcatct cacctgccct 420 

gcctgcctgt ctacctgcca agagctgtac tactgggcta attcagggct cttaacctgg 480 

aattggtaca tagatttcag ggattctgtg aatttggatg gaaaaataat tgtatctttg 540 

ttttcaataa cacctcacta aaatgaagca tttcctttag ttatgaatgt aggcaacaaa 600 

gtaccagttg tattaatgta cctgtgactt tgtcttcagt aggattcaca atactttcat 660 

atcatgttct agttgcctca gatatctcaa aatagtattt atactcatca ctgcttcaaa 720 

atgaaaatag ttattaggcc caccactaag agttgatata taatgtgtta ataaatggca 780 

cgtcttatta tatattacag attttgaaaa aga 813 



<210> 30 

<211> 5 

<212> DNA 

<213> Artificial 
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<400> 
ctttga 



30 



5 



<210> 31 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> expression primer 

<400> 31 

ccaagggagt gtgcaagg 18 

<210> 32 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> expression primer 



<400> 32 

cctttgtaaa agccatcagg aaact 
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